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Preface 



The subject of smart antennas is beginning to enjoy immense popularity 
due to the current exponential growth in all forms of wireless communi- 
cations and sensing. This rapid growth has been facilitated by advanc- 
ing digital signal processing hardware and also by the global interest in 
wideband wireless applications. Many in the wireless communication 
world are hoping to utilize smart antennas to boost capacities, expand 
bandwidths, increase signal-to-interference ratios, mitigate fading, and 
improve MIMO communications. Many others in the defense commu- 
nity are hoping to utilize smart antennas to help facilitate secure com- 
munications, direction finding, waveform diversity applications, MIMO 
radar, and multi-mission operations. Both groups are looking to use the 
benefits of smart antennas for similar reasons but are approaching the 
subject from different requirements and perspectives. This text has 
been written in the hopes of providing each group with a fundamental 
understanding of smart antennas without needing to tie the treatment 
to any one specific application. 

This author has an extensive background in signal processing, radar, 
communications, and electromagnetics. Having worked in industry and 
in academia, it has been difficult to find a smart antenna text that can 
be equally shared by both communities. In addition, only a few books in 
print even address the subject of smart antennas. Since smart antennas 
involve an amalgamation of many different disciplines, a background 
in each related area must be understood in order to appreciate this 
topic as a whole. Thus, one overriding goal of this text is to present the 
fundamentals of several different science and engineering disciplines. 
The intent is to show how all of these disciplines converge in the study 
of smart antennas. To understand smart antenna behavior, one must 
be versed in various topics such as electromagnetics, antennas, array 
processing, propagation, channel characterization, random processes, 
spectral estimation, and adaptive methods. Thus, the book lays a back- 
ground in each of these disciplines before tackling smart antennas in 
particular. 



xi 
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This text is organized into eight chapters. Chapter 1 gives the back- 
ground, motivation, justification, and benefits for the study of this 
important topic. Chapter 2 provides a summary of electromagnetics, 
reflection, diffraction, and propagation. These concepts help in under- 
standing the path gain factor, coverage diagrams, and fading. This ma- 
terial will be used later in order to understand the nature of multipath 
propagation and the phase relationship between different array ele- 
ments. Chapter 3 deals with general antenna theory including antenna 
metrics such as beamwidth, gain, principle plane patterns, and effective 
apertures. The Friis transmission formula is discussed because it aids in 
the understanding of spherical spreading and reception issues. Finally 
the specific behavior of dipoles and loops is addressed. The goal is to 
help the reader gain a basic understanding of how individual antenna 
elements affect the behavior of arrays. Chapter 4 addresses the sub- 
ject of antenna arrays. Array phenomenology is addressed in order to 
help in understanding the relationship of array shape to beam patterns. 
Linear, circular, and planar arrays are discussed. Array weighting or 
“shading” is explored in order to help the reader understand how array 
weights influence the radiation pattern. Specific arrays are discussed 
such as fixed beam arrays, beamsteered arrays, Butler matrices, and 
retrodirective arrays. This treatment of array behavior is invaluable 
in understanding smart antenna behavior and limitations. Chapter 5 
lays a foundation in random variables and processes. This is neces- 
sary because multipath signals and noise are characterized by random 
behavior. Also, channel delays and angles of arrival tend to be random 
variables. Thus, a minimum background in random processes must be 
established in order to understand the nature of arriving signals and 
how to process array inputs. Many smart antenna applications require 
the computation of the array correlation matrix. The topics of ergodicity 
and stationarity are discussed to help the reader understand the nature 
of the correlation matrix, prediction of the matrix, and what informa- 
tion can be extracted in order to compute the optimum array weights. 
It is assumed that students taking a smart antenna class are already 
familiar with random processes. However, this chapter is provided to 
help understand gird concepts, which will be addressed in later chap- 
ters. Chapter 6 addresses propagation channel characteristics. Such 
critical issues as fading, delay spread, angular spread, dispersion, and 
equalization are discussed. In addition, MIMO is briefly defined and 
addressed. If one understands the nature of multipath fading, one can 
better design a smart antenna that minimizes the deleterious effects. 
Chapter 7 discusses numerous different spectral estimation methods. 
The topics range from Bartlett beamforming to Pisarenko harmonic de- 
composition to eigenstructure methods such as MUSIC and ESPRIT. 
This chapter helps in understanding many useful properties of the 
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array correlation matrix as well as demonstrating that angles of ar- 
rival can be predicted with greater accuracy than the array resolution 
can allow. In addition, many of the techniques discussed in Chapter 7 
help aid in understanding adaptive methods. Chapter 8 shows the his- 
torical development of smart antennas and how weights can be com- 
puted by minimizing cost functions. The minimum mean-squared-error 
method lends itself to understanding iterative methods such as least 
mean squares. Several iterative methods are developed and discussed 
and their performance is contrasted with numerous examples. Current 
popular methods such as the constant modulus algorithm, sample ma- 
trix inversion, and the conjugate gradient method are explored. Lastly, 
a waveform diversity concept is discussed wherein a different waveform 
is applied to each array element in order to determine angles of arrival. 
This method has application to both MIMO communications and MIMO 
radar. 

Numerous MATLAB examples are given in the text, and most home- 
work questions require the use of MATLAB. It is felt that if students 
can program smart antenna algorithms in MATLAB, a further depth of 
understanding can be achieved. All MATLAB codes written and used 
for the completion of the book are available for use by the reader. It is 
intended that these codes can serve as templates for further work. 

Examples and Solutions on the Web 

The MATLAB codes are available at the website: http://books 
.mcgraw-hill.com. The codes are organized into three categories — 
examples, figures, and problems. There are codes produced in conjunc- 
tion with example problems in most chapters. The codes are denoted 
by sa ex# #.m. For example, Example 8.4 in Chapter 8 will have an 
associated MATLAB code labeled sa ex8 4.m. There are codes used to 
produce most of the figures in the book. These codes are denoted by 
sa Jig#_#.m. Thus, if one is interested in replicating a book figure or 
modifying a book figure, the code associated with that figure can be 
downloaded and used. There are also codes used to produce most of 
the homework solutions. These codes are denoted by sa prob# #.m. The 
students will have download access to the codes associated with the 
figures and examples. The instructor will have access to the codes as- 
sociated with figures, examples, and problem solutions. 

For Instructors 

This book can be used as a one-semester graduate or advanced 
undergraduate text. The instructor should allot two to three weeks for 
exploring Chapter 7 and three to four weeks for exploring Chapter 8. 
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The extent to which earlier chapters are treated depends upon the back- 
ground of the students. It is normally assumed that the prerequisites 
for the course are an undergraduate course in communications, an un- 
dergraduate course in advanced electromagnetics, and a basic course in 
random processes. The students may be able to take this course if they 
have a background in random processes, which is addressed in an un- 
dergraduate course in communications. It is assumed that all students 
have had an advanced engineering math course, which covers matrix 
theory including the calculation of eigenvalues and eigenvectors. 

It is my hope that this book will open new doors of understanding 
for the uninitiated and serve as a good resource material for the prac- 
titioner. 
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Chapter 

1 

Introduction 



This text has been written in response to the recent extreme interest 
in the rapidly growing field of smart antennas. Although some of the 
principles of smart antennas have been around for over forty years, new 
wireless applications demanding smart antenna technology are grow- 
ing exponentially. In addition, the latest algorithms that control smart 
antennas have matured to the point of being extremely effective in dy- 
namic and dispersive multipath environments. Thus, smart antennas 
are now becoming a critical adjunct for increasing the performance of 
a myriad of wireless applications. This new technology has a major 
role in all forms of wireless systems ranging from mobile cellular to 
personal communications services (PCS) to radar. This text will not ad- 
dress specific applications as much as it will introduce the reader to the 
basic principles which underlie smart antennas. A solid foundation is 
necessary in order to understand the full applicability and benefit of 
this rapidly growing technology. 

1 .1 What is a Smart Antenna? 

The term “smart antenna” generally refers to any antenna array, ter- 
minated in a sophisticated signal processor, which can adjust or adapt 
its own beam pattern in order to emphasize signals of interest and to 
minimize interfering signals. 

Smart antennas generally encompass both switched beam and beam- 
formed adaptive systems. Switched beam systems have several avail- 
able fixed beam patterns. A decision is made as to which beam to 
access, at any given point in time, based upon the requirements of 
the system. Beamformed adaptive systems allow the antenna to steer 
the beam to any direction of interest while simultaneously nulling 
interfering signals. The smart antenna concept is opposed to the fixed 

i 
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(a) (b) 

Figure 1 .1 (a) Traditional array ( b ) Smart antenna. 



beam “dumb antenna,” which does not attempt to adapt its radiation 
pattern to an ever-changing electromagnetic environment. In the past, 
smart antennas have alternatively been labeled adaptive arrays or digi- 
tal beamforming arrays. This new terminology reflects our penchant for 
“smart” technologies and more accurately identifies an adaptive array 
that is controlled by sophisticated signal processing. Figure 1.1 con- 
trasts two antenna arrays. The first is a traditional, fixed beam array 
where the mainlobe can be steered, by defining the fixed array weights 
w. However, this configuration is neither smart nor adaptive. 

The second array in the figure is a smart antenna designed to adapt to 
a changing signal environment in order to optimize a given algorithm. 
An optimizing criterion, or cost function, is normally defined based upon 
the requirements at hand. In this example, the cost function is defined 
as the magnitude of the error squared, |e| 2 , between the desired signal 
d and the array output y. The array weights w are adjusted until the 
output matches the desired signal and the cost function is minimized. 
This results in an optimum radiation pattern. 

1 .2 Why are Smart Antennas 
Emerging Now? 

The rapid growth in demand for smart antennas is fueled by two major 
reasons. First, the technology for high speed analog-to-digital convert- 
ers (ADC) and high speed digital signal processing is burgeoning at an 
alarming rate. Even though the concept of smart antennas has been 
around since the late 50s [1-3], the technology required in order to 
make the necessary rapid and computationally intense calculations has 
only emerged recently. Early smart antennas, or adaptive arrays, were 
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limited in their capabilities because adaptive algorithms were usually 
implemented in analog hardware. With the growth of ADC and dig- 
ital signal processing (DSP); what was once performed in hardware 
can now be performed digitally and quickly [4]. ADCs, which have res- 
olutions that range from 8 to 24 bits, and sampling rates approach- 
ing 20 Gigasamples per second (GSa/s), are now a reality [5]. In time, 
superconducting data converters will be able to sample data at rates 
up to 100 GSa/s [6]. This makes the direct digitization of most radio 
frequency (RF) signals possible in many wireless applications. At the 
very least, ADC can be applied to IF frequencies in higher RF fre- 
quency applications. This allows most of the signal processing to be 
defined in software near the front end of the receiver. In addition, DSP 
can be implemented with high speed parallel processing using field 
programmable gate arrays (FPGA). Current commercially available 
FPGAs have speeds of up to 256 BMACS. 1 Thus, the benefits of smart 
antenna integration will only flourish, given the exponential growth in 
the enabling digital technology continues. 

Second, the global demand for all forms of wireless communication 
and sensing continues to grow at a rapid rate. Smart antennas are 
the practical realization of the subject of adaptive array signal pro- 
cessing and have a wide range of interesting applications. These ap- 
plications include, but are not limited to, the following: mobile wire- 
less communications [7], software-defined radio [8, 9], wireless local 
area networks (WLAN) [10], wireless local loops (WLL) [11], mobile 
Internet, wireless metropolitan area networks (WMAN) [12], satellite- 
based personal communications services, radar [13], ubiquitous radar 

[14] , many forms of remote sensing, mobile ad hoc networks (MANET) 

[15] , high data rate communications [16], satellite communications [17], 
multiple-in-multiple-out (MIMO) systems [18], and waveform diversity 
systems [19]. 

The rapid growth in telecommunications alone is sufficient to justify 
the incorporation of smart antennas to enable higher system capacities 
and data rates. It is projected that the United States will spend over 
$137 billion on telecommunications in the year 2006. Global expendi- 
tures on telecommunications are rapidly approaching $3 trillion. 

1 .3 What are the Benefits 
of Smart Antennas? 

Smart antennas have numerous important benefits in wireless applica- 
tions as well as in sensors such as radar. In the realm of mobile wireless 
applications, smart antennas can provide higher system capacities by 



1 BMACS: Billion multiply accumulates per second. 
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Figure 1.2 (a) Sectorized (6) Smart Antennas. 



directing narrow beams toward the users of interest, while nulling other 
users not of interest. This allows for higher signal-to-interference ra- 
tios, lower power levels, and permits greater frequency reuse within the 
same cell. This concept is called space division multiple access (SDMA). 
In the United States, most base stations sectorize each cell into three 
120° swaths as seen in Fig. 1.2a. This allows the system capacity to 
potentially triple within a single cell because users in each of the three 
sectors can share the same spectral resources. Most base stations can 
be modified to include smart antennas within each sector. Thus the 120° 
sectors can be further subdivided as shown in Fig. 1.2 b. This further 
subdivision enables the use of lower power levels, and provides for even 
higher system capacities and greater bandwidths. 

Another benefit of smart antennas is that the deleterious effects of 
multipath can be mitigated. As will be discussed in Chap. 8, a con- 
stant modulus algorithm, which controls the smart antenna, can be 
implemented in order to null multipath signals. This will dramatically 
reduce fading in the received signal. Higher data rates can be realized 
because smart antennas can simultaneously reduce both co-channel in- 
terference and multipath fading. Multipath reduction not only benefits 
mobile communications but also applies to many applications of radar 
systems. 

Smart antennas can be used to enhance direction-finding (DF) tech- 
niques by more accurately finding angles-of-arrival (AO A) [20]. A vast 
array of spectral estimation techniques can be incorporated, which are 
able to isolate the AOA with an angular precision that exceeds the reso- 
lution of the array. This topic will be discussed in detail in Chap. 7. The 
accurate estimation of AOA is especially beneficial in radar systems for 
imaging objects or accurately tracking moving objects. Smart antenna 
DF capabilities also enhance geo-location services enabling a wireless 
system to better determine the location of a particular mobile user. 
Additionally, smart antennas can direct the array main beam toward 
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signals of interest even when no reference signal or training sequence 
is available. This capability is called blind adaptive beamforming. 

Smart antennas also play a role in MIMO communications sys- 
tems [18] and in waveform diverse MIMO radar systems [21, 22]. Since 
diverse waveforms are transmitted from each element in the transmit 
array and are combined at the receive array, smart antennas will play 
a role in modifying radiation patterns in order to best capitalize on 
the presence of multipath. With MIMO radar, the smart antenna can 
exploit the independence between the various signals at each array el- 
ement in order to use target scintillation for improved performance, to 
increase array resolution, and to mitigate clutter [19]. 

Many smart antenna benefits will be discussed in detail in Chaps. 7 
and 8. In summary, let us list some of the numerous potential benefits 
of smart antennas. 

■ Improved system capacities 

■ Higher permissible signal bandwidths 

■ Space division multiple access (SDMA) 

■ Higher signal-to-interference ratios 

■ Increased frequency reuse 

■ Sidelobe canceling or null steering 

■ Multipath mitigation 

■ Constant modulus restoration to phase modulated signals 

■ Blind adaptation 

■ Improved angle-of-arrival estimation and direction finding 

■ Instantaneous tracking of moving sources 

■ Reduced speckle in radar imaging 

■ Clutter suppression 

■ Increased degrees of freedom 

■ Improved array resolution 

■ MIMO compatibility in both communications and radar 



1.4 Smart Antennas Involve 
Many Disciplines 

The general subject of smart antennas is the necessary union between 
such related topics as electromagnetics, antennas, propagation, com- 
munications, random processes, adaptive theory, spectral estimation, 
and array signal processing. Figure 1.3 demonstrates the important 
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Figure 1 .3 Venn diagram relating various disciplines 
to smart antennas. 



relationship between each discipline. Many previous attempts have 
been made to explain smart antennas from the background of a single 
discipline; however, this myopic approach appeals only to small seg- 
ments of the engineering community and does not yield a full appreci- 
ation for this valuable subject. No single engineering discipline can be 
the sole province of this rapidly growing field. The subject of smart an- 
tennas transcends specific applications and thus merits a more global 
treatment. In order to fundamentally understand smart antennas, one 
must be versed in many varied and related topics. One could argue that 
some of the disciplines displayed in Fig. 1.3 can be merged to create a 
smaller list. However, the specialist, in each of these specific disciplines, 
brings a unique contribution to the general field of smart antennas. 
Thus, this book is an attempt to preview all of the above disciplines and 
to relate each of them to the subject as a whole. 

1 .5 Overview of the Book 

As has been mentioned, this text has been written in order to provide 
the reader with a broad and fundamental understanding of the subject 
of smart antennas. A foundation of basic principles is laid in each of the 
supporting disciplines which relate to smart antennas as a whole. These 
various disciplines are outlined chapter by chapter in this book. Elec- 
tromagnetics, antennas, and arrays are discussed in Chaps. 2 to 4. This 
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foundation is critical in better understanding the physics supporting 
smart antenna behavior. Random processes along with specific proba- 
bility distributions are discussed in Chap. 5. This aids in the under- 
standing of noise, channels, delay spread, angular spread, and channel 
profiles. Propagation channel characterization is discussed in Chap. 6. 
This helps the reader to understand the basics of multipath and fading. 
This will prove invaluable in understanding the limitations of smart 
antenna performance. AOA estimation is discussed in Chap. 7. This 
chapter explores the various techniques employed to estimate AOA and 
lays a foundation for understanding eigenstructure methods. Finally, 
smart antennas are discussed at length in Chap. 8. The history of smart 
antenna development is discussed. Numerous adaptive algorithms are 
explained and explored. 

The intent is that the interested reader can use the principles out- 
lined in this book in order to become very knowledgeable in the funda- 
mentals of smart antennas. In addition, numerous MATLAB examples 
are given. It is believed that a concept can be more fully understood if it 
can be modeled in software and solutions can be visualized. Numerous 
MATLAB script files are provided on a CD so that the student can un- 
derstand how to program these various algorithms and observe their 
performance. It is ultimately hoped, with this fundamental understand- 
ing, that the student can use this information as a spring-board for more 
advanced work in this fascinating field. The chapters are summarized 
as follows: 

Chapter 1: Introduction 

Chapter 2: Fundamentals of Electromagnetic Fields 
Chapter 3: Antenna Fundamentals 
Chapter 4: Array Fundamentals 

Chapter 5: Principles of Random Variables and Processes 
Chapter 6: Propagation Channel Characteristics 
Chapter 7: Angle-of-Arrival Estimation 
Chapter 8: Smart Antennas 
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2 

Fundamentals of 
Electromagnetic Fields 



The foundation for all wireless communications is based upon under- 
standing the radiation and reception of wireless antennas as well as 
the propagation of electromagnetic fields between these antennas. 
Regardless of the form of wireless communications used or the par- 
ticular modulation scheme chosen, wireless communication is based 
upon the laws of physics. Radiation, propagation, and reception can be 
explained through the use of Maxwell’s four foundational equations. 



2.1 Maxwell’s Equations 

It was the genius of James Clerk Maxwell 1 to combine the previous 
work of Michael Faraday, 2 Andre Marie Ampere, 3 and Carl Fredrick 
Gauss 4 into one unified electromagnetic theory. (Some very useful ref- 
erences describing electromagnetics basics are Sadiku [1], Hayt [2], and 



Games Clerk Maxwell (1831-1879): A Scottish born physicist who published his trea- 
tise on electricity and magnetism in 1873. 

2 Michael Faraday (1791-1867): An English born chemist and experimenter who con- 
nected time varying magnetic fields with induced currents. 

3 Andre Marie Ampere (1775-1836): A French born physicist who found that current 
in one wire exerts force on another wire. 

4 Carl Fredrick Gauss (1777-1855): A German born mathematical genius who helped 
to establish a worldwide network of terrestrial magnetism observation points. 



9 
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Ulaby [3].) Maxwell’s equations are given as follows: 



Faraday’s law 


- 3B 

V x E= 

dt 


(2.1) 


Ampere’s law 


- 9 5 - 

V X H= l-e/ 

dt 


(2.2) 


Gauss’s laws 


Jv-D = p 


(2.3) 




\ V • B= 0 


(2.4) 



where E — electric field intensity vector (V/m) 

D — electric flux density vector (C/m 2 ) 

H — magnetic field intensity vector (A/m) 

B — magnetic flux density vector (W/m 2 ) 

J — volume current density vector (A/m 2 ) 
p = volume charge density (C/m 3 ) 

The electric flux density and the electric field intensity are related 
through the permittivity of the medium as given by 

D = sE (2.5) 

The magnetic flux density and the magnetic field intensity are related 
through the permeability of the medium as given by 

B — fiH (2.6) 

where s — s r so — permittivity of the medium (F/m) 

eo = permittivity of free space = 8.85 x 10~ 12 F/m 
/i = /q./p) = permeability of the medium (H/m) 

/in = permeability of free space = 47r x 10~ 7 Fl/m 

With no sources present and expressing the fields as the phasors E s 
and H s , Maxwell’s equations can then be written in phasor form as 



V x E s = -jcoixH s (2.7) 

V x H s — (a + ja>e)E s (2.8) 

V • E s = 0 (2.9) 

V • H s = 0 (2.10) 



The phasor form of Maxwell’s equations assumes that the fields are 
expressed in complex form as sinusoids or can be expanded in sinusoids, 
that is, E — Re\E s e J '” 1 }; H = Re{// S e /V ' 7 [. Thus solutions stemming from 
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the use of Eqs. (2.7) to (2.10) must be sinusoidal solutions. One such 
solution is the Helmholtz wave equation. 

2.2 The Helmholtz Wave Equation 

We can solve for the propagation of waves in free space by taking the 
curl of both sides of Eq. (2.7) and eliminating H s by using Eq. (2.8). The 
result can be re-written as 

V x V x E s — —joop/a + jcos)E s (2.11) 

We also can invoke a well-known vector identity where V x V x E s = 
V( V • E s ) — V 2 E S . Since we are in free space, where no sources exist, the 
divergence of E s equals zero as given by Eq. (2.9). Equation (2.11) can 
thus be rewritten as 

V 2 E S - y 2 E s = 0 (2.12) 

where 

y 2 = jcopia + jtos ) (2.13) 

Equation (2.12) is called the vector Helmholtz 5 wave equation and y 
is known as the propagation constant. Since y is obviously a complex 
quantity, it can be more simply expressed as 

y=a+jft (2.14) 

where a is attenuation constant (Np/m) and ft is phase constant (rad/m). 

Through a simple manipulation of the real part of y 2 and the magni- 
tude of y 2 , one can derive separate equations for a and ft as given by 



a — &) 

N 



ps 

2 




ft — 00 



N 



ps 

2 




(2.15) 



(2.16) 



It can be seen that the attenuation constant in Eq. (2.15) and the 
phase constant in Eq. (2.16) are functions of the radian frequency to, 
constitutive parameters p and s, and also of the conductivity of the 
medium a. The term a /cos is typically referred to as the loss tangent. 



*Hermann Helmholtz (1821-1894): A German born physician who served in the Prus- 
sian army fighting Napolean. He was a self-taught mathematician. 
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Figure 2.1 a/p vs. loss tangent. 



When a medium has a loss tangent <.01, the material is said to be 
a good insulator. Indoor building materials, such as brick or concrete, 
have loss tangent values near .1 at 3 GHz. When the loss tangent is 
>100, the material is said to be a good conductor. Figure 2.1 shows a 
plot of a/p vs. the loss tangent. 

We can also solve for the propagation of magnetic fields in free space 
by taking the curl of both sides of Eq. (2.8) and substituting Eq. (2.7) to 
get the Helmholtz equation for H s as given by 

V 2 H s - y 2 H s = 0 (2.17) 

The propagation constant is identical to that given in Eq. (2.14). 

2.3 Propagation in Rectangular Coordinates 

The vector Helmholtz equation in Eq. (2.12) can be solved in any orthog- 
onal coordinate system by substituting the appropriate del (V) operator 
for that coordinate system. Let us first assume a solution in rectangular 
coordinates. Figure 2.2 shows a rectangular coordinate system relative 
to the earth’s surface. 

It is assumed that the z axis is perpendicular to the surface whereas 
the x and y coordinates are parallel. Let us also assume that the elec- 
tric field is polarized in the z-direction and is only propagating in the 
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Figure 2.2 Rectangular coordinate system 
relative to the earth. 



x -direction. Thus Eq. (2.12) can be furthered simplified. 

cPEy, o 

d J - Y 2 E xs = 0 (2.18) 

The solution is of the form 

E zs (x) = E 0 e~ yx + E ie yx (2.19) 

Assuming that the field propagates only in the positive x -direction 
and is finite at infinity then E\ must be equal to 0 giving 

E zs (x ) = E 0 e~ yx (2.20) 

We can revert the phasor of Eq. (2.20) back to the time domain by 
reintroducing e Jml . Thus 

E(x, t) = Re{E 0 e- yx e jwt z} = Ee{E 0 e- ax e j(a,t - px) z} 



or 



E(x,t) = Eoe “cos (cot — /3x)z (2.21) 

Figure 2.3 shows an example plot of the normalized propagating E-field 
at a fixed point in time. 

The attenuation constant in Eq. (2.21) is representative of a medium 
with an ideal homogeneous conductivity. In a more realistic radio wave 
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propagation model, the attenuation is further affected by the presence 
of atmospheric gases, clouds, fog, rain, and water vapor. This is es- 
pecially true for propagation frequencies above 10 GHz. Thus a more 
sophisticated model is necessary to more accurately represent the full 
signal attenuation. Three good references explaining atmospheric at- 
tenuation, due to other factors, are Collin [4] , Ulaby, Moore, Fung [5] , 
and Elachi [6]. Figure 2.4 shows the attenuation by the molecular res- 
onance of uncondensed water vapor. A resonance condition can be seen 
to occur at about 22 GHz. The equation for plotting Fig. 2.4 was taken 
from Frey [7] . 

2.4 Propagation in Spherical Coordinates 

We may also calculate the propagation of electric fields from an isotropic 
point source in spherical coordinates. The traditional wave equation 
approach is developed through vector and scalar potentials as shown in 
Collin [4] or Balanis [8]. However, a thumbnail derivation, albeit less 
rigorous, can be taken directly from Eq. (2.12) in spherical coordinates. 6 
This derivation assumes an isotropic point source. Figure 2.5 shows a 
spherical coordinate system over the earth. 



6 In the far-field E = — jcoA , thus the Helmholtz wave equation is of the same form for 
either E or A. 
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Figure 2.5 Spherical coordinate system 
relative to the earth. 



We will assume that the source is isotropic such that the electric field 
solution is not a function of (6, <f>). (It should be noted that we have made 
no assumption that the isotropic source is an infinitesimal dipole. This 
allows a simplification of the solution. ) Assuming that the electric field 
is polarized in the 0 -direction and is only a function of r, we can write 
Eq. (2.12) in spherical coordinates as 

d dr - Y 2 r l E es = 0 ( 2 . 22 ) 

For finite fields, the solution can be seen to be of the form 

E n e~ Yr 

E es (r ) = — (2.23) 

r 

As before, we can express the phasor of Eq. (2.23) in the time domain 
to get 

r* p—ar 

E(r,t)= — cos [cot — fir)9 (2.24) 

r 

The difference between Eq. (2.23) in spherical coordinates and 
Eq. (2.20) in rectangular coordinates is due to the fact that there is 
a point source producing the propagating wave thus giving rise to the 
1/r dependence. This factor is termed spherical spreading implying that 
since the radiation emanates from a point source, the field spreads out 
as if over the surface of a sphere whose radius is r. Since all finite length 
antennas are used to generate radio waves, all propagating electric far- 
fields undergo a spherical spreading loss as well as the attenuation loss 
due to factors discussed earlier. The solution depicted in Eqs. (2.23) and 
(2.24) is identical in form to the more classically derived solutions. The 
term Eq can be viewed as being frequency dependent for finite length 
sources. 

2.5 Electric Field Boundary Conditions 

All electric and magnetic field behavior is influenced and disrupted 
by boundaries. Boundaries interrupt the normal flow of propagating 
fields and change the field strengths of static fields. All material 
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discontinuities give rise to reflected, transmitted, refracted, diffracted, 
and scattered fields. These perturbed fields give rise to multipath con- 
ditions to exist within a channel. As the number of material disconti- 
nuities increases, the number of multipath signals increases. Boundary 
conditions on electric fields must be established in order to determine 
the nature of the reflection, transmission, or refraction between dielec- 
tric media. Scattering or diffraction conditions are accounted for by 
different mechanisms. These will be discussed in Sec. 2.8. 

Two of Maxwell’s equation, in integral form, can be used to establish 
electric field boundary conditions. These are the conservation of energy 
as given by 



j E ■ dl = 0 

and the conservation of flux as given by 

j) D • dS = Q e , 



(2.25) 



(2.26) 



Equation (2.25) can be applied to find tangential boundary conditions 
(E t ) and Eq. (2.26) can be applied to find normal boundary conditions 
( D n ). Let us recast the electric field strength and the electric flux density 
as having tangential and normal components relative to the boundary. 

E = E t + E„ (2.27) 

D = D t + D n (2.28) 



Figure 2.6 shows the boundary between two media and the correspond- 
ing tangential and normal electric fields on each side of the boundary. 

Applying Eq. (2.25) to the loop shown in Fig. 2.6, and allowing the loop 
dimensions to become very small relative to the radii of curvature of 
the boundary, we obtain the following simplification of the line integral: 

A h A li 

E t2 At - E n2 — - E nl — - E tl M = 0 (2.29) 




Figure 2.6 Dielectric boundary with E fields. 
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Figure 2.7 Dielectric boundary with electric flux 
density. 

Allowing the loop height Ah — »■ 0, Eq. (2.29) becomes 

E t i = E t 2 (2.30) 

Thus, tangential E is continuous across the boundary between two 
dielectrics. 

Figure 2.7 shows the boundary between two media and the corre- 
sponding tangential and normal electric flux densities on each side of 
the boundary. The boundary surface has a surface charge density p s . 

Applying Eq. (2.26) to the cylindrical closed surface shown in Fig. 2.7, 
and allowing the cylinder dimensions to become very small relative to 
the radii of curvature of the boundary, we obtain the following simpli- 
fication of the surface integral: 

D n2 As - As = p s As 



or 



D n 2-D n i = p s (2.31) 

Thus, normal D is discontinuous across a material boundary by the 
surface charge density at that point. 

We can apply the two boundary conditions given in Eqs. (2.30) and 
(2.31) to determine the refraction properties of two dissimilar dielec- 
tric materials. Let us assume that the surface charge at the boundary 
between the two materials is zero (p s = 0). Let us also construct a sur- 
face normal h, pointing into region 2, as shown in Fig. 2.8. Then E\ and 




Figure 2.8 D and E at a dielectric boundary. 
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D] are inclined at an angle 0] with respect to the surface normal. Also, 
Z ?2 and D -2 are inclined at an angle O 2 with respect to the surface normal. 
Applying the boundary condition in Eq. (2.30) we get 

Ei sin 0i = E t \ — E t 2 = E 2 sin 0 2 



or 



Ei sin 0i = E 2 sin 0 2 (2.32) 

In the same way, we can apply a similar procedure to satisfy the bound- 
ary conditions of Eq. (2.31) to yield 

S 1 E 1 cos 0i = D n 1 = D n2 = e 2 E 2 cos 0 2 



or 



£l-Z?i COS 01 = S 2 E 2 COS 62 



(2.33) 



Dividing Eq. (2.32) by Eq. (2.33) we can perform simple algebra to obtain 
a relationship between the two angles of the corresponding E fields. 



tan 0i e r i 

tan 0 2 s ,-2 



(2.34) 



Example 2.1 Two semi-infinite dielectrics share a boundary in the z = 0 
plane. There is no surface charge on the boundary. For z < 0, s r i = 4. For 

z > 0, e r 2 = 8. 

If Oi — 30°, what is the angle 0 2 ? 

Solution Let this problem be illustrated in Fig. 2.9. 

Using Eq. (2.34) it can be found that 

0 2 = tan -1 tantq^ = 49.1° 



Example 2.2 Two semi-infinite dielectrics share a boundary in the z = 0 
plane. There is no surface charge on the boundary. For z < 0, e r i = 4. For 




Figure 2.9 For Example 2.1. 
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z > 0, s r 2 = 8. Given that Ei = 2x + 4 y + 6 z, what is the electric field in 
region 2? 

Solution We can use Eq. (2.34) by finding the angle 0\ from the equation for E\ . 

However, it would be simpler to apply the boundary conditions of Eqs. (2.30) 
and (2.31). Thus using Eq. (2.30) 



Ef l — 2x + 4y — E t 2 



Also, D n 2 = e r2 e 0 , E n2 = 8e 0 , E n2 z and D nl = s r \ e 0 , E nl = 4fi 0 (6z). Thus 
using Eq. (2.31) E n 2 = 3 yielding 

E 2 = 2x + 4y + 3 z 



2.6 Magnetic Field Boundary Conditions 

The magnetic field boundary conditions are duals of the boundary con- 
ditions for electric fields. The remaining two Maxwell’s equations, in 
integral form, can be used to establish these magnetic boundary condi- 
tions. These are Ampere’s circuital law as given by 



H dl — 1 



and the conservation of magnetic flux as given by 



J B dS= 0 



(2.35) 



(2.36) 



Equation (2.35) can be applied to find tangential boundary conditions 
(H t ) and Eq. (2.36) can be applied to find normal boundary conditions 
( B n ). Let us recast the magnetic field intensity and the magnetic flux 
density as having tangential and normal components relative to the 
magnetic boundary. 



H = H t + H n (2.37) 

B = B t + B n (2.38) 

Figure 2. 10 shows the boundary between two media and the correspond- 
ing tangential and normal magnetic fields on each side of the boundary. 
In addition, a surface current density K flows along the boundary. 

Applying Eq. (2.37) to the loop shown in Fig. 2.10, and allowing the 
loop dimensions to become very small relative to the radii of curvature of 
the boundary, we obtain the following simplification of the line integral: 
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Figure 2.10 Magnetic boundary with E fields. 



Allowing the loop height Ah -»• 0, Eq. (2.39) becomes 

H a - H t 2 = K (2.40) 

Thus, tangential H is discontinuous across the boundary between two 
magnetic materials. 

Figure 2.11 shows the boundary between two media and the corre- 
sponding tangential and normal magnetic flux densities on each side 
of the boundary. The boundary has no corresponding magnetic surface 
charge because magnetic monopoles do not exist. 

Applying Eq. (2.36) to the cylindrical closed surface shown in Fig. 2.11, 
and allowing the cylinder dimensions to become very small relative to 
the radii of curvature of the boundary, we obtain the following simpli- 
fication of the surface integral: 



B„ 2 As - B n i As = 0 



or 



B a 2 = Bn 1 (2.41) 

Thus, normal B is continuous across a magnetic material boundary. 

We can apply the two boundary conditions given in Eq. (2.41) to 
determine the magnetic refractive properties of two dissimilar magnetic 
materials. Let us assume that the surface current density at the bound- 
ary between the two materials is zero ( K — 0). Performing a similar 




Figure 2.11 Dielectric boundary with electric flux 
density. 
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operation as in Sec. 2.5, we can use simple algebra to obtain a relation- 
ship between the two angles of the corresponding flux lines such that 



tan 0i 
tan (>2 



Mrl 

Mr2 



(2.42) 



Examples 2.1 and 2.2 can be applied to magnetic fields with the same 
corresponding values for the relative permeabilities achieving the same 
results. This is not surprising since electric and magnetic fields and 
electric and magnetic media are duals of each other. 



2.7 Planewave Reflection 
and Transmission Coefficients 

Multipath signals are the consequence of the transmitted signal 
reflecting, transmitting, and diffracting from various structures along 
the route to the receiver. In this section we will deal exclusively with 
the reflection and transmission of planewaves. One aspect of calculating 
each multipath term is being able to predict the reflection and transmis- 
sion through various materials. The boundary conditions of Eqs. (2.30) 
and (2.40) can be invoked to allow us to determine the reflection and 
transmission coefficients. The simplest case is to predict the reflection 
and transmission across a planar boundary at normal incidence. The 
details of the derivation can be found in Sadiku [1]. 

2.7.1 Normal incidence 

Figure 2.12 shows a plane wave normally incident upon a planar ma- 
terial boundary. 

Ei s and H ls symbolize the incident fields, in phasor form, propagat- 
ing in the positive z-direction. E rs and H rs symbolize the reflected fields 
propagating in the minus z-direction. E ts and H ts symbolize the trans- 
mitted fields propagating in the positive z-direction. The exact expres- 
sions for the E and II fields are given by the following expressions: 



Material 1 
Hh Eh O'! 
“E is 

H is 6— 




Material 2 
/^ 2 ? ^ 2 ’ 




Z 



Figure 2.12 Plane wave normally incident on a material 
boundary at z = 0. 
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Incident fields: 



Reflected fields: 



Transmitted fields: 



E is (z) = E io e VlZ x 

H is (z) = — e ~ nz y 
>?1 



E rs (z ) = E r0 e YlZ x 

H rs (z) = e wz y 

>?1 



E ts (z) = E t0 e V2Z x 

H ts (z ) = — 

'12 



m = 



>72 



/ V 1 




Si 




1-7 


Ql 

(OS\ 


/ /£ 2 




S2 





(2.43) 

(2.44) 

(2.45) 

(2.46) 

(2.47) 

(2.48) 



where the intrinsic impedances are given by 

= intrinsic impedance of medium 1 



1-7 — 

J 01S2 



= intrinsic impedance of medium 2 



Obviously the intrinsic impedances are dependent upon the loss tan- 
gent as well as the propagation constants in both media. 



Assuming that there is no surface current at the boundary and uti- 
lizing the tangential boundary conditions of Eqs. (2.30) and (2.40), one 
can derive the reflection and transmission coefficients, respectively, as 

R = m ~ >n = \R\e j0R (2.49) 

V2 + Vi 

T = — — = \T\e je T (2.50) 

>72 + 'll 

Knowing the reflection and transmission coefficients R and T, one can 
determine the total electric field in regions 1 and 2. The total electric 
field in region 1 is given by 

E\ s = E{ s + E rs = Ejo |_e ]lZ + Re J1 Jx (2.51) 

whereas the total electric field in region 2 is given by 



i?2s = TEoe Y2Z x 



(2.52) 
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When a nonzero reflection coefficient exists and region 1 is lossless, a 
standing wave is established. This standing wave gives rise to an inter- 
ference pattern, which is a function of distance from the boundary. This 
interference is a trivial example of fading that occurs in many wireless 
applications. It is instructive to derive this standing wave envelope. 

The total field in region 1 can be reexpressed using the polar form 
of the reflection coefficient. If we assume that region 1 is lossless 
(i.e., o ' i = 0). Then 



E ls = Ei o [e~ jhz + \R\e j(hz+eR) ]x (2.53) 

Combining the real and imaginary parts in Eq. (2.53) we get 

E u = Ei o [ cos(jSiz) + \R\ cos( j}\z + Or) 

+ (|i?| sin(diz + Or) — sm(Piz))e^ nl2)1 \x (2.54) 

We may now convert the phasor of Eq. (2.54) into instantaneous time 
form 



Ei(z, t) = EioKcosiPiz) + |i?| cos( fiiz + Or)) cos cot 

— (|i?| sin(/hz + Or) — sin(diz)) sin&rf] (2.55) 

Since Eq. (2.55) contains two components that are in phase quadrature, 
we may easily find the magnitude to be given as 

\Eiiz)\ 

= E i0 xAcosf/hz) + \R\ cos(diz + Or)) 2 + (sin(/3i z) - \R\ sinf^iz + Or)) 2 

= E w Vl + \R\ 2 + 2 ]R\ cos(2^!2 + Or) (2.56) 

Equation (2.56) has extrema when the cosine term is either +1 or —1. 
Thus the maximum and minimum values are given as 

Iffllmax = Eio \/ 1 + |f?| 2 + 2|i?| = Ejo( 1 + \R\) (2.57) 

l^ilmin = Eioy/1 + l#l 2 - 2|i?| = E i0 ( 1 - \R\) (2.58) 

The standing wave ratio s is defined as the ratio of |^i| max /|^ 2 lmin- 

Example 2.3 A boundary exists between two regions where region 1 is free 
space and region 2 has the parameters fi 2 = Mo> s 2 — 4«q, and a 2 = 0. If 
Ei 0 = 1, use MATLAB to plot the standing wave pattern over the range 

—An < /)j z < 0. 
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Figure 2.13 Standing wave pattern for normal incidence 

(f/2 = JJi/2). 



Solution Solving for the reflection coefficient 




Using Eq. (2.56) and MATLAB, the standing wave pattern appears as shown 
in Fig. 2.13. 

Normal incidence is a special case of the more interesting oblique 
incidence. Oblique incidence will be discussed in the next section. 

2.7.2 Oblique incidence 

The oblique incidence case is considerably more complicated than the 
normal incidence case and an intensive derivation of the reflection and 
transmission coefficients can be seen in Sadiku [1], The oblique in- 
cidence reflection and transmission coefficients are called the Fresnel 
coefficients. Only the highlights are given in this discussion. 

Figure 2.14 depicts an incident field upon a boundary. It is assumed 
that both media are lossless. The electric field is parallel to the plane 
of incidence. The plane of incidence is that plane containing the sur- 
face normal and the direction of propagation. The angles O r , and 9 t 
are the angles of incidence, reflection, and transmission with respect to 
the surface normal (=be axis). By a careful application of the boundary 
conditions, given in Eqs. (2.30) and (2.40), one can determine two laws. 
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Figure 2.14 Parallel polarization 
reflection and transmission. 



The first is Snell’s law of reflection, which states that the angle of reflec- 
tion equals the angle of incidence. (This property is also termed specular 
reflection.) 



Q r = 6i (2.59) 

The second result is the consequence of the conservation of phase and 
is also called Snell’s law of refraction. 



fi sin Oi — f 2 sin 9 t (2.60) 

Parallel polarization. The incident field, for the parallel polarization 
case, is indicated in Fig. 2.14. The coordinate system is rotated from 
the coordinate system in Fig. 2.12 so as to indicate reflection from a 
horizontal surface. This is often the case for elevated antennas over a 
flat earth. This is a parallel-polarized field because the E field is in the 
y-z plane, which is the plane of incidence. 

The incident, reflected, and transmitted electric fields are given by 

E is = E 0 (cos Qiy + sin < 3 #) e - j My<z* 0 i- 2 '*» ( k ) (2.61) 

E rs = #i|£o(cos Qiy - sin e i z) e - jhly ^ (h+zwse ‘ ) (2.62) 

E ts = T u E 0 (cose t y + sinQ t z)e- jMysilie ‘- ZC0aet) (2.63) 

where the reflection and transmission coefficients are given as 

)? 2 COS 9 t - rn cos Qi 

K\\ = 



tl 2 cos Q t + i;i cos Qi 



(2.64) 
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and 



T\\ = 



2)72 cos Oi 

rj 2 cos O t + iii cos Of 



(2.65) 



The term cos O t in Eqs. (2.64) and (2.65) can be easily calculated using 
Eq. (2.60) to be 



cos O t = J 1 - sin 2 O t = til — ^ l£l sin 2 Oi (2.66) 

V M2£2 

Figure 2.15 shows a plot of the magnitude of the reflection and trans- 
mission coefficients in the case that both media are nonmagnetic and 
lossless. The permittivities are given respectively as si = eo and £2 = 
2e 0 , 8fio, 32£ 0 . 



Reflection coefficient 




Figure 2.15 Reflection and transmission coefficient magnitude for 
parallel polarization. 
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Figure 2.16 Perpendicular polar- 
ization reflection and transmis- 
sion. 



Perpendicular polarization. The incident field, for the parallel polariza- 
tion case, is indicated in Fig. 2.16. 

The incident, reflected, and transmitted electric fields are given by 

E is = E o xe~ Myshl0i - zcosei) (2.67) 

E rs = R L E 0 xe- jfll<ysin0 ' +zcos(k) (2.68) 

E ts = T ± E 0 xe~ jfi2{ysine ‘- zcose ‘ ) (2.69) 



where the reflection and transmission coefficients are given as 

112 COS Q t - 771 cos e t 

it j_ — 

rj 2 cos Ot + iji cos O t 



(2.70) 



and 



T± 



2/72 cos Oi 

112 cos Oi + in cos O t 



(2.71) 



Figure 2.17 shows a plot of the magnitude of the reflection and trans- 
mission coefficients in the case that both media are nonmagnetic and 
lossless. The permittivities are given as £i = eo and £2 = 2f; 0 , 8so, 32 £ 0 , 
respectively. 



2.8 Propagation Over Flat Earth 

Having discussed the planewave reflection coefficients for parallel and 
perpendicular polarization, we are now in a position to analyze the 
propagation of planewaves over flat earth. Even though the earth has 
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Reflection coefficient 




Figure 2.17 Reflection and transmission coefficient magnitude for 
perpendicular polarization. 



curvature, and this curvature dramatically affects long distance propa- 
gation, we will limit this discussion to short distances and assume that 
the earth is flat. This allows us to make some propagation generaliza- 
tions. This topic is a critical start in understanding general multipath 
propagation problems because the flat earth model allows us to include 
a second indirect path. This second path will produce interference ef- 
fects at the receiver. 

Let us consider isotropic transmit and receive antennas as shown in 
Fig. 2.18. The transmitting antenna is at height hi, and the receiving 
antenna is at height / 12 , and the two antennas are separated by a hori- 
zontal distance d. The reflection coefficient from the surface of the earth 
is R and can be approximated either by Eq. (2.64) or (2.70). It should 
be noted that R is complex for most earth reflections. 

The received signal at the receiver is composed of the direct path 
propagation as well as the reflected signal at the point y. Thus, the 
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composite signal, due to the direct and indirect paths is proportional to 
the following equation: 



e -jkn 



n 



p-jkr 2 

+ R- 



r 2 



(2.72) 



where k is the wavenumber as given by the dispersion relation 



k 2 =k 2 +k 2 +k 2 = p 2 (2.73) 

with k — co^fjle = 2i r /X 

The reflection coefficient R is normally complex and can alter- 
natively be expressed as R = \R\e~^ . Through some simple algebra, 
it can be shown that 



ri = s/d 2 + (h 2 - /ti) 2 (2.74) 

r 2 = sjd 2 + (h 2 + h x ) 2 (2.75) 



Factoring the direct path term out of Eq. (2.72), we get 

e -jhri 
ri 



1 + R 



r i 
— < 
r*2 



-jUn-r i) 



(2.76) 



The magnitude of the second term in Eq. (2.76) is called the path gain 
factor F. F is also similar to the height gain defined in Bertoni [9]. 
Thus 



F = 



1 + R-e- jk(r2 ~ ri) 
r 2 



(2.77) 



This factor is analogous to the array factor for a two-element array 
separated by a distance of ‘2h\. In addition, the point of reflection 
at y is the solution to a simple algebra problem and is given as 
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y = dh\ j(h\ + h 2 ). Using the definition of the path gain factor, we can 
rewrite Eq. (2.76) as 

e -j kr i 

F (2.78) 

n 

If we make the assumption that the antenna heights hi, h 2 <?C n , r 2 , 
we can use a binomial expansion on Eqs. (2.74) and (2.75) to simplify 
the distances. 



n = \jd 2 + (h 2 - hi) 2 « d + ^ 

2d 

r 2 = \Jd 2 + (h 2 + hi) 2 d + ^ 



Consequently, the difference in path lengths is given as 

2h\h 2 
r 2 -n = — — 



(2.79) 

(2.80) 



(2.81) 



We additionally also can assume that r\/r 2 ss 1 for long distances. Under 
these conditions, we have a shallow grazing angle at the point of reflec- 
tion. Thus, R — 1. By substituting R and Eq. (2.81) into the path gain 
factor, we now can get 



F= 2 



. kh\h 2 
sin — - — 
d 



= 2 



sin 



2nh\ h 2 
d X 



(2.82) 



Figure 2.19 shows a typical plot of F for a range of values of h 2 /X where 
hi — 5 m ,d= 200 m. It is clear that the received signal can vary between 




Figure 2.19 Path gain factor. 
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zero and twice the direct path signal strength due to constructive and 
destructive interference created by the indirect path. The path gain 
factor can also be used to create a coverage diagram. The details of 
which can be found in [4] . 

2.9 Knife-Edge Diffraction 

In addition to received fields being disrupted by reflection from the 
ground, additional propagation paths can be created by diffraction from 
hills, buildings, and other objects. These structures may not be posi- 
tioned at angles so as to allow specular reflection; however, they might 
provide for a diffraction term in the total received field. Figure 2.20 
shows a hill of height h located between the transmit and receive 
antennas. This hill can be modeled as a half-plane or knife edge. It is 
assumed that there is no specular reflection from the hilltop. It is also 
assumed that the hill blocks any possible reflections from the ground, 
which arrive at the receiving antenna. Thus, the received field is only 
composed of the direct path and the diffraction path terms. h c is the 
clearance height from the knife-edge to the direct path. h c < 0 corre- 
sponds to the knife-edge being below the line of sight and therefore 
two propagation paths exist as shown in the figure. When h c > 0 the 
knife-edge obstructs the direct path. Thus only the diffraction term 
is received. d\ and d-i are the respective horizontal distances to the 
knife-edge plane (d = d\ + dz). The diffracted field can allow for a 
received signal even when line of sight is not possible. If the receive 




Figure 2.20 Knife-edge diffraction from a hill. 




More Books Visit : www.iqbalkalmati.blogspot.com 



32 Chapter Two 



antenna has no direct path to the transmit antenna, the receive antenna 
is said to be in the shadow region. If the receiver is in the line of sight 
of the transmitter, it is said to be in the lit region. The derivation of the 
solution for the diffracted field can be found in Collin [4] or in Jordan and 
Balmain [10] . It can be shown that the path gain factor due to diffraction 
is given by 



where 




- jnu2/2 du 



~H r 



H c 




(2.83) 



Thus, we can replace the path gain factor F in the flat-earth model with 
the path gain factor for diffraction. We can therefore rewrite Eq. (2.78) 
using Fd to become 

p-jfo' 

F d (2.84) 

r 

Figure 2.21 shows the path gain factor F d for a range of H, values. It 
can be seen that when the knife-edge is below the line of sight (h c < 0), 
there is an interference pattern as the direct path and the diffracted 
path phase in and out. However, when the knife-edge is above the line 




Figure 2.21 Path gain factor for knife-edge diffraction. 
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of sight (h c > 0) no direct path exists and the field quickly diminishes 
deeper into the shadow region. In the case where h c — 0, the path gain 
factor is .5 showing that the received field is 6 dB down from the direct 
path alone. 
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Problems 

2.1 For a lossy medium (cr / 0) with the following constitutive parameters, 
p — 4/xq, £ = 2so, o/a>s = 1, f = 1 MHz, find a and ft. 

2.2 For a lossy material such that p — 6/j.q , s — so- If the attenuation constant 
is 1 Np/m at 10 MHz, find 

(a) The phase constant f 

( b ) The loss tangent 

(c) The conductivity a 

id) The intrinsic impedance 

2.3 Use MATLAB to plot the ratio a/f for the range of .01 < a/coe < 100 with 
the horizontal scale being log base 10. 

2.4 Using Eq. (2.21), if p = pq, e = 4 sq, o/cve = 1, f = 100 MHz, how far 
must the wave travel in the z direction before the amplitude is attenuated 
by 30 percent? 

2.5 Two semi-infinite dielectrics share a boundary in the z = 0 plane as 
depicted in Fig. P2.1. There is no surface charge on the boundary. For z < 0, 
e rl = 2. For z > 0, e,. 2 = 6. If — 45°, what is the angle 02? 
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Figure P2.1 For Prob. 2.5. 



2.6 Two semi-infinite dielectrics share a boundary in the z = 0 plane. There is 
no surface charge on the boundary. For z < 0, s r i = 2. For 2 > 0, e r 2 = 4. Given 
that Ei = 4x + 2y + 3z in region 1, find the electric field in region 2(FJ 2 )? 

2.7 Two semi-infinite magnetic regions share a boundary in the z = 0 plane. 
There is no surface current density K on the boundary. For z < 0, // r] = 4. For 
z > 0, p r 2 = 2. Given that //] = 4x + 2y + 3z in region 1, what is the magnetic 
flux density in region 2 ( B 2 )? 

2.8 A plane wave normally incident on a material boundary as indicated in 
Fig. P2.2. Both regions are nonmagnetic. e rl = 2, s r2 = 4. If the loss tangent in 
region 1 is 1.732 and the loss tangent in region 2 is 2.8284, find the following: 

(a) The intrinsic impedance in region 1 

( b ) The intrinsic impedance in region 2 
(c j The reflection coefficient R 

( d ) The transmission coefficient T 

2.9 Modify Prob. 2.8 such that the loss tangent in region 1 is zero. If E{q = 1, 
use MATLAB to plot the standing wave pattern for the range —2 n < /iz < 0. 

2.10 In the oblique incidence case with parallel polarization, as depicted in 
Fig. 2.14, both regions are nonmagnetic and e,i = 1, e r2 = 6. If the loss tangent 



Material 1 
jU 1- e l> 




Material 2 
f-l 2 , ^ 2 ’ 





Z 



Figure P2.2 Plane wave at normal incidence on a boundary. 
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in region 1 is 1.732 and the loss tangent in region 2 is 2.8284, 9i = 45°. Find 
the following: 

(a) The intrinsic impedance in region 1 

( b ) The intrinsic impedance in region 2 

(c) The reflection coefficient 

(d) The transmission coefficient T\\ 

2.1 1 Repeat Prob. 2.10 but for the perpendicular polarization case as depicted 
in Fig. 2.16. 

2.12 Region 1 is free space and region 2 is lossless and nonmagnetic with 
e r 2 = 2, 8, 64. Use MATLAB and superimpose plots for all three dielectric 
constants as indicated below for a range of angles 0 < 9 < 90° 

(a) R|| 

(b) Til 

(c) Rl 

(d) T± 

2.13 For two antennas over flat earth, derive the equation for the specular 
reflection point y in terms of d, hi, and /i 2 - 

2.14 Using Fig. 2.18 with hi — 20 m, /12 = 200 m, and d = 1 km, R = — 1, 
use MATLAB to plot the exact path gain factor [Eq. (2.77)] for 200 MHz < f < 
400 MHz. Assume free space. 

2.1 5 Repeat Prob. 2.14 but allow f = 300 MHz and plot F for the range 500 111 
< d < 1000 m. Assume free space. 

2.16 Repeat Prob. 2.14 but allow the incident field be perpendicular to the 
plane of incidence such that E is horizontal to the ground. Allow the ground to 
be nonmagnetic and have a dielectric constant s r — 4. 

(a ) What is the Fresnel reflection coefficient? 

(b) Plot the path gain factor F for 200 MHz < f < 400 MHz. 




Figure P2.3 Hill between two antennas. 
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2.17 Use MATLAB and plot the path gain factor due to diffraction (F,i) found 
in Eq. (2.82) for the range of values —8 < H c < 8. 

2.18 For the hill between two antennas as shown in Fig. P2.3 where h\ = 
150 m, /12 = 200 m, d\ = 300 m, cL 2 = 700 m, d = 1 km, h = 250 m, f = 300 MHz. 

(a) What is the direct path length ri? 

( b ) What is the indirect path length 

(c) What is the clearance height h c (be sure your sign is right)? 

(i d ) What is the integration height H c ? 

(e) Using Eq. (2.84), what is the magnitude of the received signal if 
E 0 = 1 V/m? 
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Chapter 

3 

Antenna Fundamentals 



The design and analysis of smart antennas assumes a working knowl- 
edge of many different but related disciplines. A smart antenna designer 
must rely on such disciples as (1) random processes, (2) electromag- 
netics, (3) propagation, (4) spectral estimation methods, (5) adaptive 
techniques, and (6) antenna fundamentals. Especially though, smart 
antenna design is heavily dependent upon a basic knowledge of antenna 
theory. It is critical to match the individual antenna behavior with the 
overall system requirements. Thus, this chapter will cover the relevant 
antenna topics such as the near and far fields that surround anten- 
nas, power densities, radiation intensities, directivities, beamwidths, 
antenna reception, and fundamental antenna designs including dipoles 
and loops. It is not necessarily required that one have an extensive 
antenna background, in order to understand smart antennas, but it 
would be wise to review some of the literature referenced in this chap- 
ter. The foundation for much of the material in this chapter is taken 
from the books of Balanis [1], Kraus and Marhefka [2], and Stutzman 
and Thiele [3] . 



3.1 Antenna Field Regions 

Antennas produce complex electromagnetic (EM) fields both near to and 
far from the antennas. Not all of the EM fields generated actually radi- 
ate into space. Some of the fields remain in the vicinity of the antenna 
and are viewed as reactive near fields; much the same way as an in- 
ductor or capacitor is a reactive storage element in lumped element cir- 
cuits. Other fields do radiate and can be detected at great distances. An 
excellent treatment of the antenna field regions is given in [1] and [2]. 
Figure 3.1 shows a simple dipole antenna with four antenna regions. 



37 
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Figure 3.1 Antenna field regions. 



The boundaries defined in the regions above are not arbitrarily 
assigned but are the consequence of solving the exact fields surrounding 
a finite length antenna. 

The four regions and their boundaries are defined as follows: 

Antenna region That region which circumscribes the physical an- 
tenna boundaries is called the antenna region as defined by 




Reactive near-field region That region which contains the reactive 
energy surrounding the antenna is called the reactive near-field 
region. It represents energy stored in the vicinity of the antenna 
which does not radiate and thus is seen in the imaginary part of 
the antenna terminal impedance. This region is defined by 

R< 0.62 

Fresnel region ( radiating near field) That region which lies between 
the reactive near-field and the Fraunhofer far-field is the Fresnel 
region or radiating near-field region. The antenna field radiates in 
this region but the radiation pattern changes with distance from the 
phase center owing to the fact that the radiated field components 
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diminish at different rates. This region is defined by 



0.62 




< R < 



2 L 2 



Fraunhofer region (far-field) That region which lies beyond the near- 
field and where the radiation pattern is unchanging with distance 
is defined as the Fraunhofer region. This is the principle region of 
operation for most elemental antennas. This region is defined by 



For practical purposes, this text generally assumes antenna radia- 
tion in the Fresnel or Fraunhofer regions. If array element coupling 
is to be considered, the reactive near-field region must be accounted 
for in all calculations. 



3.2 Power Density 

All radiated antenna fields carry power away from the antenna which 
can be intercepted by distant receiving antennas. It is this power which 
is of use in communication systems. As a trivial example, let us assume 
that the propagating phasor fields, generated by a point source isotropic 
antenna, are given further and are expressed in spherical coordinates. 

E es = —e~ ]kr e V/m (3.1) 

r 

H (I)S = —e~ jkr i> A/m (3.2) 

;;r 

where ;/ is the intrinsic impedance of the medium. 

If the intrinsic medium is lossless, the time-varying instantaneous 
fields can be easily derived from Eqs. (3.1) and (3.2) to be 

E(r, t) = Re(—e j(01t - kr) e\ = — cos (cot - kr)0 (3.3) 

[ r Jr 

H(r, t) = Re (— = — cos (cat - kr)$ (3.4) 

[rjr J i]r 

The electric field intensity in Eq. (3.3) is seen to radiate in the positive 
r direction and is polarized in the positive 6 direction. The magnetic field 
intensity in Eq. (3.4) is seen to radiate in the positive r direction and is 
polarized in the positive </> direction. Figure 3.2 shows the field vectors 
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Figure 3.2 EM field radiating 
from a point source. 



in spherical coordinates. These far-fields are mutually perpendicular 
and are tangent to the sphere whose radius is r. 

The Poynting vector, named after J. H. Poynting, 1 is the cross product 
of the electric and magnetic field intensities and is given as 

P — E x H W/m 2 (3.5) 

The cross product is in the right-handed sense and gives the direction 
of propagation of the power density. The Poynting vector is a measure of 
the instantaneous power density flow away from the source. By substi- 
tuting Eqs. (3.3) and (3.4) into Eq. (3.5) and using a simple trigonometric 
identity we get 

— E 2 

P{r, t) = — [1 + cos(2&>£ — 2 kr)]r (3.6) 

2 r)r z 

The first term in Eq. (3.6) represents the time average power density 
radiating away from the antenna whereas the second term represents 
an instantaneous ebb and flow. By taking the time average of Eq. (3.6) 
we can define the average power density. 

1 f T - 

W(r)=- P(r, t)dt 
f Jo 

E 2 

= — V W/m 2 (3.7) 

2 i]r z 

The calculation of the time average power density is equivalent to per- 
forming a calculation in phasor space. 

W(r, 6, <p) = *R e(E s x H s *) = ^\E S \ 2 (3.8) 

Z ' Zi] 



^ohn Henry Poynting (1852-1914). A student of Maxwell who derived an equation to 
show energy flow from EM fields. 
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Figure 3.3 Power density from an 
isotropic point source. 



Equation (3.8) represents the average power density flow away from 
the isotropic antenna and thus is not a function of 0 or <p. For practical 
antennas, the power density is always a function of r and at least one 
angular coordinate. 

In general, the power density can be represented as a power flow 
through a sphere of radius r as shown in Fig. 3.3. 

The total power radiated by an antenna is found by the closed surface 
integral of the power density over the sphere bounding the antenna. 
This is equivalent to applying the divergence theorem to the power 
density. The total power is thus given by 

P tot = j)J W ■ cLs — J J W r (r, 0, <p)r 2 sin OdOdcp 




W r (r, 0, <p)r 2 dQ, W 



(3.9) 



where d£l = sin OdOdcp — element of solid angle or the differential solid 
angle. 

In the isotropic case where the power density is not a function of 0 or 
cp, Eq. (3.9) simplifies to become 



Ptot — 




W r (r)r 2 sin OdOdcp — A7tr 2 W r {r) 



or conversely 



W r (r) = 



Ptot 

Artr 2 



(3.10) 



(3.11) 



Thus, for isotropic antennas, the power density is found by uniformly 
spreading the total power radiated over the surface of a sphere of 
radius r. Thus the density diminishes inversely with r 2 . It is also inter- 
esting to observe that the power density is only a function of the real 
power (Ptot) delivered to the antenna terminals. The reactive power does 
not contribute to the radiated fields. 
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Figure 3.4 Pattern plots for two 
ranges ri and r 2 . 



Example 3.1 Find the total power radiated by an isotropic antenna whose 
electric field intensity is given as 

E s = -e~ Jkr 3 V/m 

r 



Solution Equation (3.7) shows the power density to be 

,2 



W(r) = 5 3 X „ 10 3 f W/m z 



Substituting this result into Eq. (3.9) yields the total power to be 

Ptot = 66.7 mW 



Example 3.2 Plot the power density vs. angle for the far electric field at two 
different distances ?q and r^. Where iq = 100 m and r<i = 200 m. Given that 



E s 

H s 



100 sin 9 „ 

e~ Jla 9 

r 

100 sin 9 _ : hr ~ 

e Jk <f> 

rjr 



Solution By using Eq. (3.8), the power density magnitude can easily be found 
to be 



W(r, 9) = 



13.3 sin 2 9 

—re- 



using MATLAB code sa_ex3_2 .m and the polar plot command we get a 
pattern plot for both distances as shown in Fig. 3.4. 



3.3 Radiation Intensity 

The radiation intensity can be viewed as a distance normalized power 
density. The power density in Eq. (3.8) is inversely proportional to the 
distance squared and thus diminishes rapidly moving away from the 
antenna. This is useful in indicating power levels but is not useful in 



More Books Visit : www.iqbalkalmati.blogspot.com 



Antenna Fundamentals 43 



indicating distant antenna patterns. The radiation intensity removes 
the 1/r 2 dependence thus making far-held pattern plots distance inde- 
pendent. The radiation intensity is thus defined as 

U(9, <P) = r 2 1 W (r, 6,<P) | = r 2 W r (r, 6, </>) (3.12) 



It is easy to show Eq. (3.12) can alternatively be expressed by 

r 2 - 

me,<t> ) = — \E s (r, 9,<p)\ 2 

At) 

2 

= r !L-\H s (r,e,<P )\ 2 (3.13) 

This definition also simplifies the calculation of the total power radia- 
ted by the antenna. Equation (3.9) can be repeated substituting the 
radiation intensity. 



p2n 



Ptot — 



W r (r, 9, <p)r 2 sin 9d9d4> 



Jo Jo 

c2n nn 



me, 0 ) do. w 



(3.14) 



Jo Jo 



The general radiation intensity indicates the radiation pattern of the 
antenna in three dimensions. All anisotropic antennas have a nonuni- 
form radiation intensity and therefore a nonuniform radiation pattern. 
Figure 3.5 shows an example of a three-dimensional pattern as dis- 
played in spherical coordinates. This antenna pattern or beam pattern 
is an indication of the directions in which the signal is radiated. In the 




Figure 3.5 Antenna 3-D pattern. 
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case of Fig. 3.5, the maximum radiation is in the 9 — 0 direction or 
along the z axis. 

Example 3.3 In the Fraunhofer region (far-field), a small dipole has an elec- 
tric field intensity given by 

- En sin 9 - 

E s (r, 9, <p) = e ~Jk > 9 

r 

What are the radiation intensity and the total power radiated by this 
antenna? 



Solution Using Eq. (3.13), the radiation intensity is given as, 



me, <p) 



Eq sin 2 9 

2 ij 



Using Eq. (3.14), the total power radiated is given as, 



Ptot = 0.011F 2 W 



3.4 Basic Antenna Nomenclature 

With an understanding of the derivation of the radiation intensity 
(U(9, </>)), we can now define some metrics that help define an antenna’s 
performance. 



3.4.1 Antenna pattern 

An antenna pattern is either a function or a plot describing the direc- 
tional properties of an antenna. The pattern can be based upon the 
function describing the electric or magnetic fields. In that case, the 
pattern is called a field pattern. The pattern can also be based upon 
the radiation intensity function defined in the previous section. In that 
case, the pattern is called a power pattern. The antenna pattern may not 
come from a functional description but also may be the result of antenna 
measurements. In this case the measured pattern can be expressed as 
a field pattern or as a power pattern. 

Figures 3.6a and b show a typical two-dimensional field pattern plot 
displayed in both rectangular coordinates and in polar coordinates. The 
mainlobe and sidelobes of the pattern are indicated. The mainlobe is 
that portion of the pattern which has maximum intended radiation. 
The sidelobes are generally unintended radiation directions. 

Figures 3.6a and b can be viewed as demonstrating a two- 
dimensional slice of what is typically a three-dimensional pattern. 
Figure 3.7 shows the same pattern displayed in three dimensions. The 
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o 





(a) (b) 

Figure 3.6 (a) Field pattern plot in rectangular coordinates. (6) Field pattern plot in 
polar coordinates. 



three-dimensional perspective is useful for illustration purposes but 
often antenna designers display two-dimensional plots in the principle 
planes of three-dimensional patterns. 

Example 3.4 Use MATLAB to produce a three-dimensional radiation 
pattern for a radiation intensity given by U(9) = cos 2 9. 

Solution Radiation patterns are normally calculated in spherical coordinates. 
That being understood, the patterns have corresponding x , y, and z coordinate 
values. We can simply use the coordinate transformation to transform points 
on the radiation intensity to (x, y, z ) coordinates. Thus 

x = U(9, cp) sin 9 cos cf) y = U{9, <f>) sing sin <f> z = U(9, cp) cos 9 




> 

y 



Figure 3.7 Field pattern plot in 
three dimensions. 
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MATLAB has a feature called ezmesh which allows us to easily plot a 
surface in three dimensions. The corresponding MATLAB commands are 

fx = inline ( 1 cos (theta) ~2 *sin ( theta) *cos (phi) 1 ) 
fy = inline ( 1 cos (theta) A 2*sin (theta) *sin (phi) 1 ) 
fz = inline ( 1 cos (theta) ~2*cos (theta) 1 ) 
figure 

ezmeshffx, fy, fz, [0 2*pi 0 pi], 100) 
colormap ( [0 0 0] ) 
axis equal 

set (gca, ' xdir 1 , ’ reverse 1 , 1 ydir ' , ' reverse 1 ) 

The plot from MATLAB code sa_ex3_4 .mis shown in Fig. 3.8. 

3.4.2 Antenna boresight 

The antenna boresight is the intended physical aiming direction of an 
antenna or the direction of maximum gain. This is also the central axis 
of the antenna’s mainlobe. In other words, it is the normally intended 
direction for maximum radiation. The boresight in Fig. 3.7 is central 
axis of the mainlobe which corresponds to the 2 -axis where 6 — 0 . 

3.4.3 Principal plane patterns 

The field patterns or the power patterns are usually taken as two- 
dimensional slices of the three-dimensional antenna pattern. These 

x = cos( Of sin( 0) cos (<j>) v = cos( Of sin( 9) si n( 0) z = cos (Of cos( 9) 




Figure 3.8 3-D radiation pattern. 
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Figure 3.9 Principle planes in 
spherical coordinates. 



slices can be defined in several different ways. One option is to plot 
the E- and H-plane patterns when the radiation is linearly polarized. 
In this case, the E-plane pattern is the plot containing the E field vector 
and the direction of maximum radiation. In the other case, the H-plane 
pattern is the plot containing the H field vector and the direction of max- 
imum radiation. It is most convenient to orient the antenna in spheri- 
cal coordinates such that the E- and H-planes correspond to the 9 and 
< p constant planes. These planes are called the azimuth and elevation 
planes, respectively. Figure 3.9 shows the spherical coordinate system 
with polarized fields and the azimuth and elevation planes parallel to 
these field vectors. 

3.4.4 Beamwidth 

The beamwidth is measured from the 3-dB points of a radiation pattern. 
Figure 3.10 shows a two-dimensional slice of Fig. 3.5. The beamwidth 
is the angle between the 3-dB points. Since this is a power pattern, the 
3-dB points are also the half power points. 




Figure 3.10 Half-power 
beamwidth. 
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In the case of field patterns instead of power patterns, the 3-dB points 
would be when the normalized pattern amplitude = = 0.707. 



3.4.5 Directivity 

The directivity is a measure of how directive an individual antenna 
is relative to an isotropic antenna radiating the same total power. In 
other words, the directivity is the ratio of the power density of an 
anisotropic antenna relative to an isotropic antenna radiating the 
same total power. Thus the directivity is given as 



D{9, <p) 



Wjo, <p) 

Ptot 
4jt r 2 



AnUiO, tp) 

Ptot 



(3.15) 



The directivity can be more explicitly expressed by substituting 
Eq. (3.14) into (3.15) to get 



D{0, tp) 



AnUiO, tp) 

ftfo me, J>) sin 0 dO dtp 



(3.16) 



The maximum directivity is a constant and is simply the maximum of 
Eq. (3.16). The maximum directivity is normally denoted by Dq. Thus, 
the maximum directivity is found by a slight modification of Eq. (3.16) 
to be 



A> = 



4?r t/ m ax 

StJoUiO, 0) si n OdOdtp 



(3.17) 



The directivity of an isotropic source is always equal to 1 since isotropic 
sources radiate equally in all directions and therefore are not inherently 
directive. 



Example 3.5 Find the directivity and the maximum directivity of the small- 
dipole radiation pattern given in Example 3.3. 



Solution Using Eq. (3.16) we find 



Did, tp) 



An sin 2 6 

fo^fo s ^ n3 QdOdtp 



1.5 sin 2 6 



As can intuitively be seen in the Example 3.5, the directivity is not affected 
by the radiation intensity amplitude but is only dependent upon the func- 
tional form of XJiO, <p). The scalar amplitude terms divide out. The maximum 
directivity is a constant and is simply given by Eq. (3.17) to be D 0 = 1.5. 

Plotting the directivity of an antenna is more useful than plotting the radi- 
ation intensity because the amplitude indicates the performance relative to 
an isotropic radiator irrespective of the distance. This is helpful in indicating 
not just the antenna pattern but also indicating a form of the antenna gain. 
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3.4.6 Beam solid angle 

The beam solid angle (£2 a) is that angle through which all of the an- 
tenna power radiates if its radiation intensity were equal to its maxi- 
mum radiation intensity (U max ). The beam solid angle can be seen by 
expressing Eq. (3.17) as 



where 



Da = 



4jt 



/oX C £ )sin ^ 



4 n 
£2a 



Qa — Beam solid angle = 




me, 0 ) 

ITmax 



sin 6d0d(p 



(3.18) 



(3.19) 



The beam solid angle is given in steradians where one steradian is 
defined as the solid angle of a sphere subtending an area on the sur- 
face of the sphere equal to r 2 . Thus there are 4 jt steradians in a sphere. 
The beam solid angle is the spatial version of the equivalent noise band- 
width in communications. An explanation of the noise equivalent band- 
width is given by Haykin [4] . 



3.4.7 Gain 

The directivity of an antenna is an indication of the directionality of 
an antenna. It is the ability of an antenna to direct energy in preferred 
directions. The directivity assumes that there are no antenna losses 
through conduction losses, dielectric losses, and transmission line mis- 
matches. The antenna gain is a modification of the directivity so as to 
include the effects of antenna inefficiencies. The gain is more reflective 
of an actual antenna’s performance. The antenna gain expression is 
given by 



G(9, <p) = eD{9, </>) (3.20) 

where e is the total antenna efficiency including the effects of losses and 
mismatches. The pattern produced by the gain is identical to the pattern 
produced by the directivity except for the efficiency scale factor e. 



3.4.8 Effective aperture 

Just as an antenna can radiate power in various preferred directions it 
can also receive power from the same preferred directions. This princi- 
ple is called reciprocity. Figure 3.11 shows transmit and receive anten- 
nas. The transmit antenna is transmitting with power Pi (Watts) and 
radiates a power density Wi (Watts/m 2 ). 
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Antenna 1 Antenna 2 

Figure 3.11 Transmit and receive antennas. 



The receive antenna intercepts a portion of the incident power density 
W\ thereby delivering power P 2 to the load. The receive antenna can be 
viewed as an effective aperture of area A e2 which captures a portion of 
the available power density. Thus, using Eqs. (3.15) and (3.20), we can 
write the received power as 



P 2 = A e2 W i = 



A e 2P\e\Di{9i, (pp 

Anrf 



W 



(3.21) 



where r\,0\, <p\ are the local spherical coordinates for antenna 1. 

If the antennas in Fig. 3.10 are reversed such that the receive antenna 
transmits and the transmit antenna receives, it can be shown that 



Pi = A el W 2 = 



A el P 2 e2D 2 (e2, (pp) 

Anri 



W 



(3.22) 



where r 2 , d 2 , </; 2 are the local spherical coordinates for antenna 2. 

The derivation is beyond the scope of this text but it can be shown 
from Eqs. (3.21) and (3.22) [1,2] that the effective aperture is related to 
the directivity of an antenna by 



l 2 X 2 

A e (0, (P) = —eD{0, cp) = —CKO, <p) (3.23) 

An An 



3.5 Friis Transmission Formula 

Harald Friis 2 devised a formula relating the transmit and received pow- 
ers between two distant antennas. We will assume that the transmit 
and receive antennas are polarization matched. (That is, the polariza- 
tion of the receive antenna is perfectly matched to the polarization cre- 
ated by the transmit antenna.) By substituting Eq. (3.23) into (3.21) 



2 Harald T. Friis (1893-1976): In 1946 developed a transmission formula. 
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ft 

■s'* 



Antenna 2 



Antenna 1 

z ‘ 



r = 1 km 






y 

h = 500 m 



Figure 3.12 Two antennas separated by a distance r. 



one can derive the following relationship: 

^ Gi(9i, </>i)G2(02> ) (3.24) 

If polarization must be taken into account, we can multiply Eq. (3.24) 
by the polarization loss factor (PLF) = pi • p2 1 2 where p\ and f >2 are the 
polarizations of antennas 1 and 2, respectively. An intensive treatment 
of polarization and the polarization loss factor can be found in ref. [2] . 

Example 3.6 Calculate the receive power P 2 a t antenna 2 if the transmit 
power P\ = 1 kW. The transmitter gain is G\{9\, <p\) = sin 2 (fh) and the 
receiver gain is G2(6*2, <p%) = sin 2 (0 2 ). The operating frequency is 2 GHz. Use 
the Fig. 3.12 to solve the problem. The y-z and y'-z' planes are coplanar. 



P2 = M 

P 1 \47rr 



Solution The geometry indicates 9i = 60° and 6*2 = 120°. Using Eq. (3.24) we 
have 

P2 = Pi ^4^:^ Gi(0i,0i)G2(02»02) 






= Pi ( -j— I sin 2 (6*i) sin 2 (# 2 ) 



= 10 ' 

= 80.1 nW 



(SS) sin2(6 °) sin2(12 °) 



3.6 Magnetic Vector Potential 
and the Far Field 

All antennas radiate electric and magnetic fields by virtue of charges 
accelerating on the antenna. These accelerating charges are normally 
in the form of an ac current. While it is possible to calculate distant 
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radiated fields directly from the antenna currents, in most cases it is 
mathematically unwieldy. Therefore an intermediate step is used where 
we instead calculate the magnetic vector potential. From the vector 
potential we can then find the distant radiated fields. 

The point form of Gauss’ law states that V • B = 0. Since the di- 
vergence of the curl of any vector is always identically equal to zero 
(V • V x A — 0), we then can define the B field in terms of the magnetic 
vector potential A. 

B = VxA (3.25) 

Equation (3.25) solves Gauss’ law. 

Since B and II are related by a constant, we can also write 

ff=-VxA (3.26) 

The electric field, in a source free region, can be derived from the mag- 
netic field by 

E=^—VxH (3.27) 

jwe 

Thus, if we know the vector potential, we can subsequently calculate E 
and H fields. 

Figure 3.13 shows an arbitrary current source I creating a distant 
vector potential A. 

The vector potential is related to the current source by 

n f p— AR 

A= f- I{r')——dV (3.28) 

An J R 

where I(r') = I x (r')x + I y (r')y + I z (r')z = current in three dimensions 
f = position vector in source coordinates 
r = position vector in field coordinates 
R = distance vector — r—r’ 

R = |i?| 

dl' = differential length at current source 



i 




Figure 3.13 Current source and 
distant vector potential. 
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The vector potential can be found from any line current source using 
Eq. (3.28). The results can be substituted into Eqs. (3.26) and (3.27) to 
find the distant fields. Two of the easier antenna problems to solve are 
the linear antenna and the loop antenna. 



3.7 Linear Antennas 

Fundamental to understanding antenna radiation is to understand the 
behavior of a straight wire or linear antenna. Not only is the math 
greatly simplified by a straight wire segment, but the linear antenna 
solution gives insight into the behavior of many more complicated 
structures which can often be viewed as a collection of straight wire 
segments. 



3.7.1 Infinitesimal dipole 

The infinitesimal dipole is a short wire segment antenna where the 
length L <£ X. It is aligned along the z-axis symmetrically placed about 
the x-y plane as shown in Fig. 3.14. 

The phasor current is given by I = I 0 z. The position and distance 
vectors are given by r = rf = xx + yy + zz, ?' = z'z, and R = xx + yy + 
(z — z')z. Thus, the vector potential is given by 



A= 



fL/2 e -jk^/x 2 +y 2 +(z-z ') 2 

J-l /2 ' \jx 2 + y 2 + (z — z') 2 



dz! 



(3.29) 



Since we are assuming an infinitesimal dipole (r z!), then R 
Thus, the integral can easily be solved yielding 



r. 



A — 



Mo 

An 




p-jkr 

Iq£ dz' = 

r 



HqIqL 
Anr e 



- Jkr z = Az 



(3.30) 



Since most antenna fields are more conveniently expressed in spheri- 
cal coordinates, we may apply a vector transformation to Eq. (3.30) or 
we may graphically determine A in spherical coordinates. Figure 3.15 
shows the vector potential in both rectangular and spherical 
coordinates. 




Figure 3.14 Infinitesimal dipole. 
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A z* 




Figure 3.15 Vector potential for 
an infinitesimal dipole. 



The A r and A„ components are vector projections of A z onto the cor- 
responding axes. It can thus be shown that 



jjLoIoLe jkr 

A r — A z cos 9 = ; cos ( 



A g = —A, sin 9 = — 



4nr 

lioIoLe 
4tz r 



-jkr 



■ sind 



(3.31) 

(3.32) 



Since H is related to the curl of A in Eq. (3.26), we can take the curl in 
spherical coordinates to get 



H* = 



jkI 0 L sin 0 
4 nr 



1 ' 

jkr 



-jkr 



(3.33) 



where H, = 0 and H g = 0. 

The electric field can be found from Eq. (3.33) by substituting into 
Eq. (3.27) giving 



E, = 



ijIoL cos 9 
2jrr 2 



1 ' 

jkr 



,-jkr 



(3.34) 



Eg = 



y'/uj/oL sin 0 

47 rr 





(3.35) 



where — 0 and // = intrinsic impedance of the medium. 

In the far-held, the higher order terms involving 1/r 2 and 1/r 3 become 
negligibly small simplifying Eqs. (3.34) and (3.35) to become 



Eg 

H 4 , 



jkiiI 0 L sin 9 _ jkr 

a e 

47rr 

jkI 0 Lsin9 _ jkr 

^ e 

4nr 



It should be noted that in the far held — ii. 



(3.36) 



(3.37) 
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Power density and radiation intensity. We can calculate the far field 
power density and the radiation intensity by substituting Eq. (3.36 ) into 
Eq. (3.8) and (3.12) to get 



W r (0,0) 



1 


ki]IoL sin 9 


2 


I 0 L 


2 sin 2 9 


2i] 


4;rr 


“ 8 


X 


r 2 



me) = 



>]_ 

8 



IoL 

X 



2 

sin 2 9 



(3.38) 

(3.39) 



where Iq — complex phasor current = /o|e 7? 

X — wavelength. 

Figure 3.16 shows a plot of the normalized radiation intensity given 
in Eq. (3.39). The radiation intensity is superimposed on a rectangular 
coordinate system. Since the antenna is aligned along the 2 -axis, the 
maximum radiation is broadside to the infinitesimal dipole. 



Directivity. The directivity, as defined in Eq. (3.16), can be applied to 
the infinitesimal dipole radiation intensity. The constant terms divide 
out in the numerator and denominator yielding 



D{0) = 



Ait sin 2 9 



Jo 11 Jo si 11 '* Qd9dtp 
This solution is exactly the same as derived in Example 3.3. 



= 1.5 sin 



(3.40) 



3.7.2 Finite length dipole 

The same procedure can also be applied to a finite length dipole to 
determine the far fields and the radiation pattern. However, since a 
finite length dipole can be viewed as the concatenation of numerous 
infinitesimal dipoles, we can use the principle of superposition to find 




Figure 3.16 3-D plot of infinitesi- 
mal dipole radiation. 
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Figure 3.17 Standing waves on a transmission line and on a dipole. 

the fields. Superposing numerous infinitesimal dipoles of length dz! 
results in an integral given as [1] 



Since dipoles are center fed and the currents must terminate at the 
ends, a good approximation for the dipole current is given as sinu- 
soidal (King [5]). It is well known that a twin lead transmission line 
with an open circuit termination generates sinusoidal standing waves 
along the conductors. If the ends of the leads are bent, so as to form a 
dipole, the currents can still be approximated as piece-wise sinusoidal. 
Figure 3.17a shows a twin lead transmission line with sinusoidal cur- 
rents. Figure 3.176 shows a twin lead transmission line with sinusoidal 
currents which is terminated into a dipole. The dipole currents can be 
viewed as an extension of the existing transmission line currents. 

Since sinusoidal currents are a good approximation to the currents on 
a linear antenna, we can devise an analytic expression for the current 
in Eq. (3.41) ([1,3]) as 



By substituting Eq. (3.42) into Eq. (3.41) we can solve for the approx- 
imate electric far field as 




(3.41) 




Io sin [A (§ — z')] 0<z'<L/2 

/o sin [A (| + z')] — L/2 < z' < 0 



(3.42) 




(3.43) 
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The magnetic field can easily be found to be 



j I 0 e jkr cos ( ^ cos 0 ) - cos ( ^ ) 
2itr sin0 



(3.44) 



Power density and radiation intensity. We can again calculate the far field 
power density and the radiation intensity to be 



W r (9, 0) 




V 


Io 


2 


COS ( cos 0) — cos(^) 


8 


nr 




sin0 



(3.45) 



me ) = 





h 


2 


COS (tt cos Q) — cos(^) 


8 


n 




sin0 



(3.46) 



Figure 3.18 shows plots of the normalized radiation intensity given in 
Eq. (3.46) in three dimensions for j = 0.5, 1, and 1.5. It can be seen that 
as the dipole length increases, the mainlobe narrows. However, in the 
L/k — 1.5 case, the mainlobe no longer is perpendicular to the dipole 
axis. Usually dipoles are designed to be of length L = kf 2. 



Directivity. The directivity of the finite length dipole is given by substi- 
tuting Eq. (3.46) into Eq. (3.16). 



471 



D(9) = 



COs(ir j COS^) — C0s(7T j) 



l2 



sin# 



p2tz r7T 

Jo Jo 



cos (tt j cosQ)—cos(tt j) 



sin# 



sin 6d6d<p 



(3.47) 



where L/k is the length in wavelengths. 

The maximum directivity is given when the numerator is at its max- 
imum value. We can plot the finite dipole maximum directivity (Do) vs. 
the length L in wavelengths to yield Fig. 3.19. 




(a) L/k = 0.5 (b)m=l 

Figure 3.18 Finite dipole radiation intensity. 



(c) Llk = 1 .5 





More Books Visit : www.iqbalkalmati.blogspot.com 



58 Chapter Three 




3.8 Loop Antennas 

In addition to solving for the fields radiated from linear antennas, it 
is also instructive to calculate fields radiated by loop antennas. Both 
linear and loop antennas form the backbone for numerous antenna 
problems and both forms of antennas shed light on general antenna 
behavior. 



3.8.1 Loop of constant phasor current 

Figure 3.20 shows a loop antenna of radius a, centered on the z axis and 
residing in the x-y plane. The loop current Iq flows in the <p' direction. 

We will assume the far field condition such that (a r). We can there- 

fore assume that r and R are approximately parallel to each other. 




Figure 3.20 Small loop antenna. 
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We can now more easily determine the vector R. We can define the 
following variables: 

r = r sin 6 cos (px + sin0 sin cpy + cos Oz (3.48) 

?' — a cos <p'x + a sin cp'y (3.49) 

R ss r — ?' ■ r — r — a sin0 cos(0 — </>') (3.50) 

The vector potential is given by the integral of Eq. (3.27) and is found 
to be 



Hr. f> JkR 

A=^ Iq4>' — ——ad(j)' (3.51) 

47T Jo R 

where (p 1 is the current direction in source coordinates. It can be conver- 
ted to field coordinates by the relation 

$' = cos((/> — <p')$ + sinicp — <p')p (3.52) 

We can safely approximate the denominator term in Eq. (3.51) as 
R r without appreciably changing the solution. The exponent term, 
however, must preserve the phase information and therefore we will use 
R exactly as defined by Eq. (3.50). Substituting Eq. (3.52) and Eq. (3.50) 
into Eq. (3.51) we get 

p— jk(r— asin#cos(</>— 0')) 

A = -p / /o<oosW-f)# + sm(0-i.')p) dill 

41 h T (3.53) 

Since the loop is symmetric, the same solution will result regardless 
of the (p value chosen. For simplification purposes, let us choose 0 = 0. 
Also, since the p term has odd symmetry, the p term integrates to zero. 
Thus, Eq. (3.53) thus simplifies to 

A — afl ° Io< £ e — f 2 * cos <p'e jkasiD0cos4, 'd<p' (3.54) 

An r Jo 

Equation (3.54) is solvable in closed form and the integral solution can 
be found in Gradshteyn and Ryzhik [6]. The vector potential solution is 
given as 



A — 



ja/iohJ) e jkr 
2 r 



Ji(kasinO) 



(3.55) 



where J\ is the Bessel function of the first kind and order 1. 
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The electric and magnetic field intensities can be found from Eq. (3.26) 
and Eq. (3.27) to be 

Ctkll Ic\ 

E,„ = — MkasinO) (3.56) 

2 r 

where E r ~ 0 and E g sa 0. 

H e = = _ a ^ e _^L Ji(tesin0) (3.57) 

r\ 2 r 

where H, and Hi, ~ 0. 

Power density and radiation intensity. The power density and radiation 
intensity can be easily found to be 

W r (0, 4>) = ^\E^\ 2 =^ ( 2 f“) 2 sin0) (3.58) 

/ o \ 2 

U(6,4>) = | 7o| 2 J/ffotsind) (3.59) 

The variable ka can also be written as 2jz . a — ( : where C is the circum- 
ference of the loop. Figure 3.21 shows plots of the normalized radiation 
intensity given in Eq. (3.59) for j — 0.5, 1.25, and 3. The radiation 
intensity is superimposed on a rectangular coordinate system. 



Directivity. The directivity of the constant phasor current loop is again 
given by Eq. (3.16). We can substitute Eq. (3.59) into Eq. (3.16) to yield 



D{6) = 



4jt J 2 ( ^asin 0) 

/ 0 2 X J 2 (ka sin 0) sin 6d9d(p 



24 2 (fsind) 

fo J l ( x s ^ n s ^ n 



(3.60) 



where C — 2jt a — loop circumference. 







(b) j = 1.25 



10 1- 3 



Figure 3.21 Loop radiation intensity. 
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The maximum directivity (Do) is given when the numerator is max- 
imum. The numerator is maximum when the Bessel function is maxi- 
mum. For C/X > 1.84 the Bessel function maximum is always 0.582. For 
C/X < 1.84 the maximum is given as 2 J^(C/X). We can plot the loop 
maximum directivity (Do) vs. the circumference C in wavelengths to 
yield Fig. 3.22. 
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Problems 

3.1 For an antenna in free space creating an electric field intensity given by 

— 4cos$ ,j, r - TT , 

E s = e~ Jkr 0 V/m 

r 

(a) What is the average power density? 

(b) What is the radiation intensity? 

(c) What is the total power radiated from the antenna? 

( d ) Use MATLAB to plot the normalized power pattern. 
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3.2 Use the help given in Example 3.4 and plot the following power patterns 
using MATLAB for —90° < 9 < 90° 

( a ) cos 4 (60 

( b ) sin 2 (d) 

(c) sin 4 (d) 

3.3 What is the 3-dB beamwidth for the patterns given in Prob. 3.2? 

3.4 For the following radiation intensities, use MATLAB to plot the elevation 
plane polar pattern when <fi = 0° and the azimuthal plane polar pattern when 

9 = 90° . What are the boresight directions for these two patterns? Plot for the 
first pattern for 0° < 9 < 180° and the second pattern for —90° < ij> < 90° 

(a) U(9, 4>) = sin 2 (d) cos 2 (0) 

(b) U(9, <j>) = sin 6 (0) cos 6 (<p) 

3.5 Calculate the maximum directivity for the following radiation intensities 

(а) U(9, <p) = 4cos 2 (60 

(б) U(9,<P) = 2sin 4 (d) 

(c) U(9, tp) = 2 sin 2 (0) cos 2 (0) 

(i d ) U{9,4>) = 6sin 2 (2d) 

3.6 Using Eq. (3.19) to find the beam solid angle for the following radiation 
intensities 

( a ) U(9, (p) = 2 cos 2 (60 

{b) U(9,4>) = 4sin 2 (£)cos 2 (0) 

(c) U(9, <j>) = 4cos 4 (d) 

3.7 For the figure shown below calculate the receive power P 2 at antenna 2 if 
the transmit power P\ = 5 kW. The transmitter gain is G\(Q\, <p\) = dsin 4 !^) 
and the receiver gain is G 2 (d 2 , fa) = 2 sin 2 (# 2 ). The operating frequency is 

10 GHz. Use the Fig. P3.1 to solve the problem. The y-z and y'-z' planes are 
coplanar. 



e 2 



Antenna 2 



Antenna 1 

z ^ 



r = 2 km 



h = 200 m 



Figure P3.1 Two antennas separated by a distance r. 
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3.8 Create three normalized plots in one polar plot figure using MATLAB for 
the finite length dipole where j = 0.5, 1, and 1.5. Normalize each plot before 
plotting. One can use the “hold on” command to overlay the plots in the same 
figure. 

3.9 What is the maximum directivity in dB for a finite length dipole whose 
length is j = 1.25. 

3.10 Create three normalized plots in one polar plot figure using MATLAB 
for the loop antenna where = 0.5, 1.25, and 3. Normalize each plot before 
plotting. One can use the “hold on” command to overlay the plots in the same 
figure. 

3.11 What is the maximum directivity in dB for the loop antenna whose cir- 
cumference in wavelengths ? = 10. 
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Chapter 

4 

Array Fundamentals 



Smart antennas are composed of a collection of two or more anten- 
nas working in concert to establish a unique radiation pattern for the 
electromagnetic environment at hand. The antenna elements are al- 
lowed to work in concert by means of array element phasing, which is 
accomplished with hardware or is performed digitally. In the previous 
chapter we considered individual antenna elements such as the dipole 
or loop. In this chapter we will look at generic collections of anten- 
nas. These collections could be composed of dipole or loop antennas but 
it is not necessary to restrict ourselves to any particular antenna ele- 
ments. As we will discover, the behavior of arrays transcends the specific 
elements used and the subject of arrays in and of itself has generated 
an extensive body of work. In fact the subject of arrays has merited 
entire textbooks devoted to the subject. Some very useful texts include 
Haykin [1], Johnson and Dudgeon [2], and Van Trees [3]. Extensive ma- 
terial can be found in the phased array text by Brookner [4] or in the 
article by Dudgeon [5] . 

Arrays of antennas can assume any geometric form. The various 
array geometries of common interest are linear arrays, circular arrays, 
planar arrays, and conformal arrays. We will begin with a discussion 
of linear arrays. A thorough treatment of linear arrays is found in the 
texts by Balanis [6] and Kraus and Marhefka [7] . 



4.1 Linear Arrays 

The simplest array geometry is the linear array. Thus, ah elements are 
aligned along a straight line and generally have a uniform interelement 
spacing. Linear arrays are the simplest to analyze and many valuable 
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insights can be gained by understanding their behavior. The minimum 
length linear array is the 2-element array 

4.1.1 Two element array 

The most fundamental and simplest array to analyze is the two-element 
array. The two-element array demonstrates the same general behav- 
ior as much larger arrays and is a good starting point in order to 
understand the phase relationship between adjacent array elements. 
Figure 4.1 shows two vertically polarized infinitesimal dipoles aligned 
along the y axis and separated by a distance d. The field point is located 
at a distance r from the origin such that r d. We can therefore assume 

that the distance vectors r\, r, and r 2 are all approximately parallel to 
each other. 

We can therefore make the following approximations: 



d . n 
n ^ r + 2 sin 0 


(4.1) 


d . ^ 

r 2 ^ r sin 6 

2 


(4.2) 



Let us additionally assume that the electrical phase of element 1 is 
— 5/2 such that the phasor current in element 1 is Ioe~^. The electrical 
phase of element 2 is +5/2 such that the phasor current in element 2 is 
/(,e y 2 . We can now find the distant electric field by using superposition 
as applied to these two dipole elements. Using Eq. (3.36) and Eqs. (4.1) 
and (4.2), and assuming that n « r\ « r in the denominator, we can 
now find the total electric field. 



Ea = 



jkr]I 0 e- j z L sin f 
47rri 

jhiloL sin 0 



-jkn 



jkiiloe-i 2 Lsind 
477-^2 



jkr 2 



-jkr 



4 nr 



■ ( kd sin 0+5) 

y J 2 



• (kd sin 0+5) 
O-J 2 



(4.3) 




-da 



da 



Figure 4.1 Two infinitesimal 
dipoles. 



0 
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where 8 = electrical phase difference between the two adjacent 
elements 
L — dipole length 

6 — angle as measured from the 2 axis in spherical coordinates 
d = element spacing 



We can further simplify Eq. (4.3) such that 

/ (kdsinO + 8 ) 



jkijI 0 Le Jkr . 

Kg — sin 0 ■ I 2 cos 

4jt r 



V 



Element factor 



Array factor 



(4.4) 



where the element factor is the far field equation for one dipole and the 
array factor is the pattern function associated with the array geometry. 

The distant field from an array of identical elements can always be 
broken down into the product of the element factor (EF) and the array 
factor (AF). The very fact that the antenna pattern can be multiplied by 
the array factor pattern demonstrates a property called pattern multi- 
plication. Thus, the far field pattern of any array of antennas is always 
given by (EF) x (AF). The AF is dependent on the geometric arrange- 
ment of the array elements, the spacing of the elements, and the elec- 
trical phase of each element. 

The normalized radiation intensity can be found by substituting 
Eq. (4.4) into Eq. (3.13) to get 



U n (0) = [sin0] 2 
= [sine] 2 



cos 



(Msine + 8) 



1 2 



(4.5) 



( nd . 8 

cos — sm e + - 
\ X 2 



We can demonstrate pattern multiplication by plotting Eq. (4.5) for 
the case where d/X = .5 and 8 — 0. This is shown in Fig. 4.2. Figure 4.2a 
shows the power pattern for the dipole element alone. Figure 4.2 b shows 
the array factor power pattern alone. And Fig. 4.2c shows the multipli- 
cation of the two patterns. 

The overriding principle demonstrated by the 2-element array is that 
we can separate the element factor from the array factor. The array fac- 
tor can be calculated for any array regardless of the individual elements 
chosen as long as all elements are the same. Thus, it is easier to first 
analyze arrays of isotropic elements. When the general array design 
is complete, one can implement the design by inserting the specific 
antenna elements required. Those antenna elements can include, but 
are not restricted to, dipoles, loops, horns, waveguide apertures, and 
patch antennas. A more exact representation of array radiation must 
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o 



o 




180 

(a) 



180 

(b) 



0 




(C) 

Figure 4.2 (a) Dipole pattern, (6) Array factor pattern, (c) Total pattern. 



always include the effects of coupling between adjacent antenna ele- 
ments. However, that topic is beyond the scope of this book and will be 
left for treatment by more advanced texts. The reader may refer to the 
text by Balanis [6] for more information on element coupling. 

4.1.2 Uniform W-element linear array 

The more general linear array is the JV-element array. For simplification 
purposes, we will assume that all elements are equally spaced and have 
equal amplitudes. Later we may allow the antenna elements to have 
any arbitrary amplitude. Figure 4.3 shows an JV-element linear array 
composed of isotropic radiating antenna elements. It is assumed that 
the nth element leads the ( n — 1) element by an electrical phase shift of 
S radians. This phase shift can easily be implemented by shifting the 
phase of the antenna current for each element. 
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Assuming far field conditions such that r ~S> d, we can derive the 
array factor as follows: 



_j_ gj(kdsin6+8) _|_ ^j2(kd sin 0+8) gj(N—l)(kdsinO+8) 



(4.6) 



where S is the phase shift from element to element. 
This series can more concisely be expressed by 



N 



N 



AF = V gj(n—l)(kd sin 0+8) V e j(n-Df 



n= 1 



n= 1 



(4.7) 



where ir — kd sin 0 + 6. 

It should be noted that if the array is aligned along the z-axis then 

= kd cosd + S. 

Since each isotropic element has unity amplitude, the entire behav- 
ior of this array is dictated by the phase relationship between the 
elements. The phase is directly proportional to the element spacing 
in wavelengths. 

The array processing and array beamforming textbooks have taken 
an alternative approach to expressing Eq. (4.7). Let us begin by defining 
the array vector. 



gj(kdsinO+8) l)(Msmfl+5)"| ^ 



(4.8) 

where [ ] T signifies the transpose of the vector within the brackets. 

The vector a(O') is a Vandermonde vector because it is in the form 
[1 z ... z (iV_1) ]. In the literature the array vector has been alternatively 
called: the array steering vector [2], the array propagation vector [8, 9], 
the array response vector [10], and the array manifold vector [3]. For 
simplicity’s sake, we will call a(J)) the array vector. Therefore, the array 



pj(kd sin 0+8) 



a(6) = 



pjiN—VikdsinO+S) 
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factor, in Eq. (4.7), can alternatively be expressed as the sum of the 
elements of the array vector. 

AF = sum(a(0)) (4.9) 



The utility of the vector notation in Eq. (4.8) will be more readily 
seen in Chaps. 7 and 8 when we study angle-of-arrival estimation and 
smart antennas. In our current development, it is sufficient to use the 
notation of Eq. (4.7). 

We may simplify the expression in Eq. (4.6) by multiplying both sides 
by e r ' ! ' such that 

e^AF = e j * + e j2 * + • • • + e JN * (4.10) 



Subtracting Eq. (4.6) from Eq. (4.10) yields 

0 e J * - 1)AF = (e jm - 1) 

The array factor can now be rewritten. 

( e iW _ i) e J^( e j^ - e ~j %*) 

AF = = a 

(ei* — 1) e 4( e 4- e -4) 

_ .^^ sin(fyr) 
sin(f) 



(4.11) 



(4.12) 



The term accounts for the fact that the physical center of the 

array is located at (N— l)d/2. This array center produces a phase shift 
of (N- 1)^/2 in the array factor. If the array is centered about the 
origin, the physical center is at 0 and Eq. (4.12) can be simplified to 
become 



AF = 8in (f *) 
sm(f) 



(4.13) 



The maximum value of AF is when the argument xj/ — 0. In that case 
AF = N. This is intuitively obvious since an array of N elements should 
have a gain of N over a single element. We may normalize the AF to be 
reexpressed as 



AF„ 



1 sin^i/O 

N sin (f) 



(4.14) 



In the cases where the argument x[r/2 is very small, we can invoke 
the small argument approximation for the sin (t/t/ 2) term to yield an 
approximation 



AF„ 



sin(f f) 

Nj, 



(4.15) 
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It should be noted that the array factor of Eq. (4.15) takes the form of 
a sin(x)/x function. This is because the uniform array itself presents a 
finite sampled rectangular window through which to radiate or receive 
a signal. The spatial Fourier transform of a rectangular window will 
yield a sin(x)/x function. The Fourier transform relationship between 
an antenna array and its radiation pattern is explained in [6] . 

Let us now determine the array factor nulls, maxima, and the main- 
lobe beamwidth. 

Nulls From Eq. (4.15), the array nulls occur when the numerator 
argument Ah/r/2 = ±mr. Thus, the array nulls are given when 

N„ , . 

—\ka sin d nu n + 8) — ±Ji7t 
z 

or 

0nuii = S in _1 (^ _<5 )) «= 1,2,3... (4.16) 

Since the sin(0 nu n) < 1, for real angles, the argument in Eq. (4.16) must 
be 1. Thus, only a finite set of it values w ill satisfy the equality. 

Example 4.1 Find all of the nulls for an 1V= 4 element array with d = .5/. 
and 5 = 0. 



Solution Substituting N, d, and 5 into Eq. (4.16) 

0null = sin_1 ( ± ^) 

Thus, 0 nuU = ±30°, ±90° 



Maxima The mainlobe maximum in 
denominator term \jr/2 — 0. Thus 



Eq. (4.15) occurs when the 




(4.17) 



The sidelobe maxima occur approximately when the numerator is a 
maximum. This occurs when the numerator argument A5///2 = ±(2n+l ) 
7r/2. Thus 



9 S = sin 




/ (2n+ 1 )jt 

V N~ 




= ± 



7 r 
2 



+ cos 1 




/ (2 71+ 1)7 r 

V N~' 




(4.18) 



Example 4.2 Find the mainlobe maximum and the sidelobe maxima for the 
case where N = 4, d = ,5X, and 5 = 0. 
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Solution Using Eq. (4.17) the mainlobe maximum can be found to be 0 max = 0 
or 7T. n is a valid solution because the array factor is symmetric about the 
6 = n / 2 plane. The sidelobe maxima can be found from Eq. (4.18) to be 

e s = ±48.59°, ±131.4° 



Beamwidth The beamwidth of a linear array is determined by the 
angular distance between the half-power points of the mainlobe. The 
mainlobe maximum is given by Eq. (4.17). Figure 4.4 depicts a typical 
normalized array radiation pattern with beamwidth as indicated. 

The two half power points (0+ and 0_) are found when the normalized 
AF„ = .707 (AF^ = .5). If we use the array approximation given in 
Eq. (4.15) we can simplify the calculation of the beamwidth. 

It is well known that sin(x)/% = .707 when x — ±1.391. Thus, the 
normalized array factor is at the half-power points when 

^(kd sin 0 ± + <5) = ±1.391 (4.19) 

£ 

Rearranging to solve for 6± we get 



e± 





(4.20) 




Figure 4.4 Half-power beamwidth of a linear array. 
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The half-power beamwidth is now easily shown to be 



HPBW = \9+ — 9-\ (4.21) 

For large arrays, the beamwidth is narrow enough such that the HPBW 
can be approximated as 



HPBW = 2|0 + — d max | = 2|0 max — d_| (4.22) 



dmax is given by Eq. (4.17) and 9± is given by Eq. (4.20). 

Example 4.3 For the 4-element linear array with 5 = —2.22 and d = .5k, 
what is the half-power beamwidth? 

Solution 6> max is first found using Eq. (4.17). Thus, 9 max — 45°. 9 + is found 
using Eq. (4.20). Thus, 9 + = 68.13°. The half-power beamwidth is then ap- 
proximated by using Eq. (4.21) to be HPBW = 46.26°. 

Broadside linear array. The most common mode of operation for a linear 
array is in the broadside mode. This is the case where <5 = 0 such that 
all element currents are in phase. Figure 4.5 shows three polar plots 
for a 4-element array for element distances d/k — .25, .5, and .75. 

This array is called a broadside array because the maximum radi- 
ation is broadside to the array geometry. Two major lobes are seen 
because the broadside array is symmetric about the 0 — ±tt/2 line. 
As the array element spacing increases, the array physically is longer, 
thereby decreasing the mainlobe width. The general rule for array ra- 
diation is that the mainlobe width is inversely proportional to the array 
length. 

End-fire linear array. The name end-fire indicates that this array’s max- 
imum radiation is along the axis containing the array elements. Thus, 
maximum radiation is “out the end” of the array. This case is achieved 
when S — —kd. Figure 4.6 shows three polar plots for the end-fire 
4-element array for the distances d/k — .25, .5, and .75. 

It should be noted that the mainlobe width for the ordinary end-fire 
case is much greater than the mainlobe width for the broadside case. 
Thus, ordinary end-fire arrays do not afford the same beamwidth effi- 
ciency as the broadside array. The beamwidth efficiency in this context 
is the beamwidth available relative to the overall array length. 

An increased directivity end-fire array has been developed by Hansen- 
Woodyard [11] where the phase shift is modified such that 8=—(kd + j.). 
This provides a dramatic improvement in the beamwidth and directiv- 
ity. Since the Hansen-Woodyard 1938 paper is not generally accessible, 
a detailed derivation can be found in [5] . 
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o o 





d = 0.254 



d = 0.54 



o 




d = 0.754 



Figure 4.5 The 4-element broadside array with 5 = 0 and d. = .254, .54, and .754. 



Beamsteered linear array. A beamsteered linear array is an array where 
the phase shift <5 is a variable thus allowing the mainlobe to be di- 
rected toward any direction of interest. The broadside and end-fire con- 
ditions are special cases of the more generalized beamsteered array. 
The beamsteering conditions can be satisfied by defining the phase shift 
S = —kdsinOo- We may rewrite the array factor in terms of beamsteer- 
ing such that 



1 sin(^(sin0 - sin do)) 
N sin(^(sin0 - sin do)) 



(4.23) 



Figure 4.7 shows polar plots for the beamsteered 8-element array for 
the d/X — .5, and 9 0 = 20, 40, and 60°. Major lobes exist above and 
below the horizontal because of array is symmetry. 
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o o 





d = 0.254 d = 0.54 



o 




d = 0.754 

Figure 4.6 The 4-element end-fire array with S = —kd and d = .254, .54, and .754. 

The beamwidth of the beamsteered array can be determined by using 
Eqs. (4.20) and (4.21) such that 

/ 2 782 \ 

8 ± =sm-^±— + sm»„J (4.241 

where S = —kdsinOo. 

60 = steering angle. 

The beamsteered array beamwidth is now given as 

HPBW = \Q + — 9-\ (4.25) 

For the N = 6-element array, where do = 45°, 9+ = 58.73°, and 0- = 
34.02°. Thus the beamwidth can be calculated to be HPBW = 24.71°. 
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o o 





e 0 = 20° 0 O = 40° 



0 




e 0 = 60 ° 



Figure 4.7 Beamsteered linear array with 9 q = 45°. 



4.1.3 Uniform W-element linear 
array directivity 

Antenna directivity was previously defined in Eq. (3.16). Directivity 
is a measure of the antennas ability to preferentially direct energy in 
certain directions. The directivity equation is repeated as follows: 



DO, <p) 



AttUO, <p) 

/f Jo UO, <p) sin ddOcL(j) 



(4.26) 



Our previous derivation for the array factor assumed the array was 
aligned along the horizontal axis. This derivation helped us to visu- 
alize the array performance relative to a broadside reference angle. 
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Figure 4.8 The JV-element linear 
array along z-axis. 



However, the horizontal array does not fit symmetrically into spherical 
coordinates. In order to simplify the calculation of the directivity, let us 
align the linear array along the z-axis as shown in Fig. 4.8. 

Since we have now rotated the array by 90° to make it vertical, we 
can modify the AF by allowing i> — kd cos 0 + S. Now the broadside 
angle is when 9 — 90°. Since the array factor is proportional to the 
signal level and not the power, we must square the array factor to yield 
the array radiation intensity U(9). We now substitute the normalized 
approximate (AF„) 2 into Eq. (4.26). 



(4.27) 

in 9 d9 d(p 

The maximum value of the normalized array factor is unity. Thus the 
maximum directivity is given as 



4 TT 



D(9) = 



' sin)^ (M cos O+fi) 
i (kd cos O+k) 



/ • 27 T rn ( £ 

o Jo \ v “ 



n(^(Mcos0+<$)) \ 4 
^(kd cos 6+8) J 



Dn = 



4tt 



r2ii fit /sin(f (Mcos0+<5)) \ 2 . nJnJj. 

fo fo ( "adcoso+6) ) sm 9d9di> 



(4.28) 



Solving for the maximum directivity is now simply the matter of solv- 
ing the denominator integral although the integral itself is not trivial. 



Broadside array maximum directivity. As was noted before, the case of 
broadside maximum directivity requires that 5 = 0. We can simplify the 
directivity equation by integrating over the c/j variable. Thus, Eq. (4.28) 
can be simplified to be 



rit f sin(fMcos0)\ 2 • „ 

jo {yrgh/coso ) smuau 



(4.29) 
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We can define the variable x — ^kd cos 6. Then dx = — ^kd sin 0 dO. 
Substituting the new variable x into Eq. (4.29) yields 



A> = 



Nkd 



r-Nkd/2 

J-Nkd/2 



^ sin(x) ^ 



2 



dx 



(4.30) 



As Nkd/2 jt, the limits can be extended to infinity without a sig- 
nificant loss in accuracy. The integral solution can be found in integral 
tables. Thus 

D 0 « 2N- (4.31) 

X. 



End-fire array maximum directivity. End-fire radiation conditions are 
achieved when the electrical phase between elements is 8 = —kd. This 
is equivalent to writing the overall phase term as i/ / = kdicosO — 1). 
Rewriting the maximum directivity, we get 



A> = 



2 



rn / sin((jM(,cos0— 1)) \ ^ 

JO ^ !± kd(cos8-l) J 



sin OdO 



(4.32) 



We may again make a change of variable such that x = ^kdicosd- 1). 
Then dx — — ^kd sin Odd. Upon substitution of x into Eq. (4.32) we have 



Do — 



Nkd 




(4.33) 



As Nkd/2 » tv, the upper limit can be extended to infinity without a 
significant loss in accuracy. Thus 



Do 




(4.34) 



The end-fire array has twice the directivity of the broadside array. 
This is true because the end-fire array only has one major lobe, while 
the broadside array has two symmetric major lobes. 



Beamsteered array maximum directivity. The most general case for the 
array directivity is found by defining the element -to-element phase 
shift S in terms of the steering angle do- Let us rewrite Eq. (4.27) by 
substituting S = —kdcosOo- 



DW,6 0 ) 



4t r 



( sin(y (kdicosO— cos#o))) 
(kd( cos S — cos ) ) 



2 



r2n rn / si n( 0 (kdi cos 9 —cos 9q ) )) 

JO JO ^ j(kd(cos6-cos6o» 



2 

sin 6 d6 d<f> 



(4.35) 
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Figure 4.9 Family of steered array maximum directivity curves. 



It is instructive to plot the linear array directivity vs. angle for a few 
different steering angles using MATLAB. We would expect the end- 
fire and broadside maximum directivities, calculated previously, to be 
two points on the more general plot of Eq. (4.35). Allowing N — 4 and 
d = .5k, the directivity is shown plotted in Fig. 4.9. The maximum val- 
ues are slightly higher than predicted by Eqs. (4.31) and (4.34) because 
no approximations were made to simplify the integration limits. 



4.2 Array Weighting 

The previous derivation for the array factor assumed that all of the 
isotropic elements had unity amplitude. Because of this assumption, 
the AF could be reduced to a simple series and a simple sin(x)/x 
approximation. 

It was apparent from Figs. 4.5 and 4.6 that the array factor has 
sidelobes. For a uniformly weighted linear array, the largest sidelobes 
are down approximately 24 percent from the peak value. The pres- 
ence of sidelobes means that the array is radiating energy in untended 
directions. Additionally, due to reciprocity, the array is receiving energy 
from unintended directions. In a multipath environment, the sidelobes 
can receive the same signal from multiple angles. This is the basis for 
fading experienced in communications. If the direct transmission angle 
is known, it is best to steer the beam toward the desired direction and to 
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shape the sidelobes to suppress unwanted signals. The sidelobes can be 
suppressed by weighting, shading, or windowing the array elements. 
These terms are taken from the EM, underwater acoustics, and array 
signal processing communities, respectively. Array element weighting 
has numerous applications in areas such as digital signal processing 
(DSP), radio astronomy, radar, sonar, and communications. Two excel- 
lent foundational articles on array weighting are written by Harris [12] 
and Nuttall [13]. 

Figure 4.10 shows a symmetric linear array with an even number 
of elements N. The array is symmetrically weighted with weights as 
indicated: 

The array factor is found by summing the weighted outputs of each 
element such that 

AFeven = W M e + . . . + ^-jikdmnB 

+ U >1 e J * kdBin0 + ■ ■ • + w M e j ^ kdsine (4.36) 

where 2M — N — total number of array elements. Each apposing pair 
of exponential terms in Eq. (4.36) forms complex conjugates. We can 
invoke the Euler’s identity for the cosine to recast the even array factor 
given as follows: 

AFeven — 2 ^ w n cos ( — ^ kd sin 6 j (4.37) 

n=\ ' ' 

Without loss of generality, the 2 can be eliminated from the expression 
in Eq. (4.37) to produce a quasi-normalization. 

M 

AFeven = ^ W n C0S((2?1 - 1 )u) (4.38) 

n= 1 



where u = ^ sin 0. 

The array factor is maximum when the argument is zero, implying 
0 = 0. The maximum is then the sum of all of the array weights. Thus, 
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Figure 4.1 1 Odd array with weights. 



we may completely normalize AF even to be 



Y^Ll w n cos((2n -l)w) 
\~~\M 

z2n=l w n 



AF — 

even — 



(4.39) 



It is easiest to express the array factor in the form of Eq. (4.33). 
However, for plotting purposes, it is best to use the normalized array 
factor given in Eq. (4.39). 

An odd array is depicted in Fig. 4.11 with the center element at the 
origin. 

We may again sum all of the exponential contributions from each 
array element to get the quasi-normalized odd array factor. 



M + 1 

AF odd = ^ w n cos(2(n — 1 )m) (4.40) 

n=l 



where 2 M + 1 — N. 

In order to normalize Eq. (4.40) we must again divide by the sum of 
the array weights to get 



AF odd — 



Y ^=\ w n cos(2(m - 1)m) 



2^n= 1 



W n 



(4.41) 



We may alternatively express Eqs. (4.38) and (4.40) using the array 
vector nomenclature previously addressed in Eq. (4.8). Then, the array 
factor can be expressed in vector terms as 



AF = w T ■ aid) 



(4.42) 



where a{9) = array vector 

w T = [wm wm- i • ■ ■ w i ■ ■ ■ wm- l wm\ 

The weights w n can be chosen to meet any specific criteria. Generally 
the criterion is to minimize the sidelobes or possibly to place nulls at 
certain angles. However, symmetric scalar weights can only be utilized 
to shape sidelobes. 
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TABLE 4.1 Pascal’s Triangle 
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There are a vast number of possible window functions available that 
can provide weights for use with linear arrays. Some of the more com- 
mon window functions, along with corresponding plots, are explained 
next. Unless otherwise noted, we will assume that the array plotted 
has N = 8 weighted isotropic elements. 

Binomial Binomial weights will create an array factor with no side- 
lobes, provided that the element spacing d si- The binomial weights 
are chosen from the rows of Pascal’s triangle. The first nine rows are 
shown in Table 4.1. 

If we choose an N = 8-element array, then the array weights are taken 
from row 8 to be w\ — 35, w% = 21, = 7, and w\ = 1. The normalized 

binomial weights are u>\ = l,u >2 = = .2,andu;4 = .0286. The eight 

array weights can more conveniently be found using the MATLAB com- 
mand diag(rot90(pascal(N))). The normalized array weights are shown 




Figure 4.12 Binomial array weights. 
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Figure 4.1 3 Array factor with binomial weights. 



plotted in Fig. 4.12 using the stem command. The weighted array factor 
is superimposed over the unweighted array factor as shown in Fig. 4.13. 
The price paid for suppressing the sidelobes is seen in the broadening 
of the main beamwidth. 

Blackman The Blackman weights are defined by 

w(k + 1) = .42 — .5cos(27r£/(lV — 1)) 

+ .08cos(47t&/(1V— l))k = 0, 1, . . . , N- 1 (4.43) 

For the N = 8 element array, the normalized Blackman weights are 
w\ = 1, W 2 = .4989, W 3 = .0983, and uo\ — 0. The eight array weights 
can be found using the Blackman(N) command in MATLAB. The nor- 
malized array weights are shown plotted in Fig. 4.14 using the stem 
command. The weighted array factor is shown in Fig. 4.15. 

Hamming The Flamming weights are given by 

w(k + 1) = .54 — .46 cos[2jtk/(N — 1)] k = 0, 1, . . . , N— 1 (4.44) 

The normalized Flamming weights are w i = 1, u >2 = .673, W 3 = .2653, 
and Wi — .0838. The eight array weights can be found using the 
hamming(N) command in MATLAB. The normalized array weights are 
shown plotted in Fig. 4.16 using the stem command. The weighted array 
factor is shown in Fig. 4.17. 
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Figure 4.14 Blackman array weights. 



Gaussian The gaussian weights are determined by the gaussian func- 
tion to be 

1 ( k—N/2 \ 2 

W{k + 1) = e 2 ' k = 0, 1, . . . , N a > 2 (4.45) 

The normalized Gaussian weights for a = 2.5 are u>\ = 1, n> 2 = .6766, 
ws = .3098, and w\ — .0960. The eight array weights can be found using 
the gausswin(N) command in MATLAB. The normalized array weights 




Figure 4.15 Array factor with Blackman weights. 
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Figure 4.1 6 Hamming array weights. 



are shown plotted in Fig. 4.18 using the stem command. The weighted 
array factor is shown in Fig. 4.19. 

Kaiser-Bessel The Kaiser-Bessel weights are determined by 



w(k ) = 




Io bra] 



k = 0,l,...,N/2 a > 1 (4.46) 




Figure 4.17 Array factor with Hamming weights. 



More Books Visit : www.iqbalkalmati.blogspot.com 



86 Chapter Four 




Figure 4.18 Gaussian array weights. 



The normalized Kaiser-Bessel weights for a = 3 are w\ = 1,W2= .8136, 
wa = .5137, and w\ = .210. The eight array weights can be found using 
the kaiser(N, a) command in MATLAB. The normalized array weights 
are shown plotted in Fig. 4.20 using the stem command. The weighted 
array factor is shown in Fig. 4.21. 

It should be noted that the Kaiser-Bessel weights provide one of the 
lowest array sidelobe levels while still maintaining nearly the same 




Figure 4.19 Array factor with gaussian weights. 
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Figure 4.20 Kaiser-Bessel array weights. 



beamwidth as uniform weights. Additionally, the Kaiser-Bessel weights 
for a = 1 yield the uniform set of weights. 

Other potential weight functions are the Blackman-Harris, 
Bohman, Hanning, Bartlett, Dolph-Chebyshev, and Nuttall. A detailed 
description of these functions can be found in [12, 13] . In addition, many 
of these weight functions are available in MATLAB. 




Figure 4.21 Array factor with Kaiser-Bessel weights. 
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4.2.1 Beamsteered and weighted arrays 

Previously, in Sec. 4.1.3 we discussed the beamsteered uniformly 
weighted array. We could steer the mainlobe to any desired direction 
but we still experienced the problem of relatively large minor lobes. The 
nonuniformly weighted array can also be modified in order to steer the 
beam to any direction desired and with suppressed sidelobe levels. 

We can repeat Eqs. (4.38) and (4.40) but we can modify them to include 
beamsteering. 



M 

AFeven = ^ W n C0S((2?l - 1 )u) 

n= 1 
M + 1 

AFodd = ^2 w n cos(2(n — Du) 

n= 1 

where 

nd, . . . 

u = — (sin 9 — sin 9o) 

A 

As an example, we can use Kaiser-Bessel weights and steer the main- 
lobe to three separate angles. Let N = 8, d = A/2, a = 3, w-i = 1, 
W 2 — .8136, W 3 = .5137, and W 4 = .210. The beamsteered array factor 
for the weighted even element array is shown in Fig. 4.22. 

In general, any array can be steered to any direction by either using 
phase shifters in the hardware or by digitally phase shifting the data at 



(4.47) 

(4.48) 




Figure 4.22 Beamsteered Kaiser-Bessel weighted array. 
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the back end of the receiver. If the received signal is digitized and pro- 
cessed, this signal processing is often called digital beamforming (DBF) 
[8] . Current technologies are making it more feasible to perform DBF 
and therefore allow the array designer to bypass the need for hardware 
phase shifters. The DBF performed can be used to steer the antenna 
beam according to any criteria specified by the user. 



4.3 Circular Arrays 

Linear arrays are very useful and instructive but there are occasions 
where a linear array is not appropriate for the building, structure, or 
vehicle upon which it is mounted. Other array geometries may be nec- 
essary to appropriately fit into a given scenario. Such additional arrays 
can include the circular array. Just as the linear array was used for 
increased gain and beamsteering, the circular array can also be used. 
Figure 4.23 shows a circular array of N elements in the x-y plane. The 
array has N elements and the array radius is a. 

The ?ith array element is located at the radius a with the phase angle 
cp n . Additionally, each element can have an associated weight w n and 
phase <5„. As before, with the linear array, we will assume far- field con- 
ditions and will assume that the observation point is such that the 
position vectors r and r„ are parallel. We can now define the unit vector 
in the direction of each array element n. 

p n = cos <j) n x + sin <p n y (4.49) 

We can also define the unit vector in the direction of the field point. 

r = sin 6 cos <fx + sin 0 sin (py + cos Oz (4.50) 




Figure 4.23 Circular array of JV-elements. 
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It can be shown that the distance r n is less than the distance r by the 
scalar projection of p n onto r. (This is indicated by the dotted line in 
Fig. 4.23). Thus, 



r n = r -apn-r 



(4.51) 



with 



p n -r = sin 9 cos (p cos <p n + sin 9 sin cp sin cp n = sin 9 cos (cp — <p n ) 



The array factor can now be found in a similar fashion as was done with 
the linear array. With some effort, it can be shown that 

N N 

AF = Y w n e- j(hap ' f+Sn) = Y w n e- j[hasiaecoM - (,, ’' )+Ki (4.52) 

72=1 72=1 



where 

27T 

(p n = — (n — 1) = angular location of each element. 

4.3.1 Beamsteered circular arrays 

The beamsteering of circular arrays is identical in form to the beam- 
steering of linear arrays. If we beamsteer the circular array to the angles 



Circular array of N - 10 elements, a = 1 X 




Figure 4.24 AF elevation pattern for beamsteered circular array 
(So = 30°, 4> 0 = 0°). 
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Figure 4.25 3-D AF pattern for 
beamsteered circular array Wo = 
30°, <t > o = 0°). 



(0o> 0o ) we can determine that the element to element phase angle is 
S n = —ka sin 9 0 cos (</>o — 0„). We can thus rewrite the array factor as 



The circular array AF can be plotted in two or three dimensions. Let 
us assume that all weights are uniform and that the array is steered to 
the angles do = 30° and 0o = 0°. With N = 10 and a = X, we can plot 
the elevation pattern in the 0 = 0° plane as shown in Fig. 4.24. 

We can also plot the array factor in three dimensions as a mesh plot. 
Using the same parameters as above, we see the beamsteered circular 
array pattern in Fig. 4.25. 

4.4 Rectangular Planar Arrays 

Having explored the linear and circular arrays, we can move on to 
slightly more complex geometries by deriving the pattern for rectan- 
gular planar arrays. The following development is similar to that found 
in both Balanis [6] and in Johnson and Jasik [14]. 

Figure 4.26 shows a rectangular array in the x-y plane. There are M 
elements in the x -direction and N elements in the y-direction creating 
anMxJV array of elements. The m-nth element has weight w mn . The 
x -directed elements are spaced cL x apart and the y-directed elements 
are spaced d y apart. The planar array can be viewed as M linear arrays 
of N elements or as N linear arrays of M elements. Since we already 



N 



AF = ^2 WnC 



— j{&a[sin0 cos(0— 0 n )— sin0o cos(0o— 



(4.53) 
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M 



y 



Figure 4.26 N x M rectangular planar array. 



know the array factor for an M or N element array acting alone, we 
can use pattern multiplication to find the pattern of the entire M x N 
element array. Using pattern multiplication we have 



The weights a m and b„ can be uniform or can be in any form according 
to the designer’s needs. This could include the various weights discussed 
in Sec. 4.2 such as the binomial, Kaiser-Bessel, Hamming, or Gaussian 
weights. The a m weights do not have to be identical to the b„ weights. 
Thus, we might choose the Om weights to be binomial weights while the 
b n weights are Gaussian. Any combination of weighting can be used and 
w mn is merely the consequence of the product a m ■ b n . 

If beamsteering is desired then the phase delays fi x and by are given 



M 



N 

sin # cos 0+/3 x ) gj(n—l)(kd y sin# sin0+/8 y ) 

71=1 



AF = AF* • AF V = ^ a m e j(m - 1)(kd: 



771=1 

M N 






,j[(m—l)(kd x sin# cos </>+&; )+( ti— l)(kd y sin# sin <f>+p y )\ 

(4.54) 




by 



fix = —kd x sin 6 0 cos <po by = —kd y sin do sin </> o (4.55) 



Example 4.4 Design and plot a 8 x 8 element array with equal element spac- 
ing such that d x = d y = .51. Let the array be beamsteered to 9q = 45° and 
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Figure 4.27 Beamsteered planar 
array pattern. 



00 = 45° . The element weights are chosen to be the Kaiser-Bessel weights 
given in Sec. 4.2. Plot the pattern for the range 0 < 9 < n/2 and 0 < 0 < 2n. 

Solution Taking the weights from Sec. 4.2 we have ai = Gq = b\ = 64 = .2352, 
and a 2 = 03 = t>2 = ^3 = 1- Thus, w\ n = w m 1 = .0055 and all other w m nS = 1. 
We can substitute the weights into Eq. (4.51) to calculate the array factor. 
MATLAB can be used to plot Fig. 4.27. 



4.5 Fixed Beam Arrays 

Fixed beam arrays are designed such that the array pattern consists 
of several simultaneous spot beams transmitting in fixed angular di- 
rections. Normally these directions are in equal angular increments 
so as to insure a relatively uniform coverage of a region in space. 
However, this is not a necessary restriction. These fixed beams can 
be used in satellite communications to create spot beams toward fixed 
earth-based locations. As an example, the Iridium 1 (low earth orbit) 
satellite constellation system has 48 spot beams per satellite. Spot 
beams are sometimes also called pin-cushion beams because of the sim- 
ilarity to pins in a pin cushion. Figure 4.28 shows an example of a planar 
array creating three spot beams. 

Fixed beams can also be used for mobile communication base sta- 
tions in order to provide space division multiple access (SDMA) capa- 
bilities. Several have treated the subject of fixed beam systems such as 
Mailloux [15], Hansen [16], and Pattan [17]. 



1 Iridium Satellite LLC 
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0! = 30° 



45° 

60° 



Figure 4.28 Three spot beams created by a 16 x 16 
planar array. 



4.5.1 Butler matrices 

One method for easily creating fixed beams is through the use of Butler 
matrices. Details of the derivation can be found in Butler and Lowe [18] , 
and Shelton and Kelleher [19]. The Butler matrix is an analog means of 
producing several simultaneous fixed beams through the use of phase 
shifters. As an example, let us assume a linear array of N elements. If 
N — 2 n elements, the array factor can be given as 



AF(<9) = 



sin ( Nit f sin 6 — p t ) 
Ntt f sin 9 — Pi 



sin [Abril sin 0 — sindO] 
Abr|(sin0 — sin0 £ ) 



with 



sin0£ 



a 

Nd 



Pi = h r 



= ±i,±- 

2 2 2 



(4.56) 



The l values create evenly spaced contiguous beams about 9 = 0°. If the 
element spacing is d = 1/2, the beams are evenly distributed over the 
span of 180°. If d > 1/2, the beams span an ever decreasing range of 
angles. Grating lobes are not created provided that the only phase shifts 
used are those defined by Pi. 
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As an example, let us choose anAl=4 element array with d — A/2. 
We can use Eq. (4.56) to produce N fixed beams and plot the results 
using MATLAB. These beams are sometimes referred to as scalloped 
beams because of the resemblance to the scallop shell. Since N — 4, 
then t = — — i, i, |. Substituting these values into the equation for 

sin 6 e , we get the polar plot as shown in Fig. 4.29. 

These beams can be created using fixed phase shifters by noting that 
fit = In = ± g , ± 3 2 t . Figure 4.30 depicts a Butler matrix labyrinth of 
phase shifters for N — 4-array elements. 

As can be noted from the figure, the 1 R port will yield the first 
beam to the right of broadside to the array. The 2 R port yields the 
second beam to the right and so on. Thus, through the use of the Butler 
matrix labyrinth, one can simultaneously look in N directions with an 
W-element array. 

4.6 Fixed Sidelobe Canceling 

The basic goal of a fixed sidelobe canceller (SLC) is to choose array 
weights such that a null is placed in the direction of interference while 
the mainlobe maximum is in the direction of interest. The concept of 
an SLC was first presented by Howells in 1965 [20]. Since that time 
adaptive SLCs have been extensively studied. A detailed description 
for an adaptive SLC will be reserved for Chap. 8. 

In our current development, we will discuss fixed sidelobe canceling 
for one fixed known desired source and two fixed undesired interferers. 
All signals are assumed to operate at the same carrier frequency. Let 
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us assume a 3-element array with the desired signal and interferers as 
shown in Fig. 4.31. 

The array vector is given by 

aid) = [ e ~j kdsine 1 eJkdsinty ( 4 . 57) 

The, as yet to be determined, array weights are given by 

w T =[w\ w 2 w 3 ] (4.58) 

Therefore, the total array output from the summer is given as 

S = w T ■ a = Wl e- jkdsin0 +w 2 + w 3 e jkdsin9 (4.59) 
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Figure 4.31 The 3-element array with desired and inter- 
fering signals. 



The array output for the desired signal will be designated by Sd whereas 
the array output for the interfering signals will be designated by S\ 
and S 2 . Since there are three unknown weights, there must be three 
conditions satisfied. 

Condition 1: S D = Wl e~ jkds in ° D +w 2 + w 3 e jkdsin6D = 1 
Condition 2: Si = Wl e~ jkds ™ e ' + w 2 + w ?J e jkdsin<h = 0 
Condition 2: S 2 = w 1 e- jkdsin ° 2 +w 2 + w 3 e jkdaia0 2 = 0 



Condition 1 demands that So = 1 for the desired signal, allowing the 
desired signal to be received without modification. Conditions 2 and 3 
reject the undesired signals. These conditions can be recast in matrix 
form as 

"g— jkdsinOD ^ gjkdsinOi) 
g—jkds\nOi -j^ gjkdsinOi 
g—jkdsm.02 -j^ gjkdsm.62 





W\ 




1 ' 




w 2 


= 


0 




w 2 




0 



(4.60) 



One can invert the matrix to find the required complex weights w\, w 2 , 
and w 3 . As an example, if the desired signal is arriving from Oo = 0° 
while 6\ = —45° and 0 2 = 60°, the necessary weights can be calculated 
to be 



W\ 




'0.748 + 0.094i' 


w 2 


= 


-0.496 


_w 2 _ 




0.748 - 0.094i_ 



(4.61) 



The array factor is shown plotted in Fig. 4.32. 
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Figure 4.32 Sidelobe cancellation. 



There are some limitations to this scheme. The number of nulls 
cannot exceed the number of array elements. In addition, the array 
maximum cannot be closer to a null than the array resolution allowed. 
The array resolution is inversely proportional to the array length. 

4.7 Retrodirective Arrays 

Retrodirective arrays are the array equivalent of a corner reflector. Cor- 
ner reflectors can be studied in any standard antenna book. It was 
Van Atta [21, 22] who invented a scheme to convert a linear array 
into a reflector. In this case, the array redirects in the incident field 
in the direction of arrival. Thus, the use of the term “retrodirective” 
is appropriate. Retrodirective arrays are sometimes also called self- 
phasing arrays, self-focusing arrays, conjugate matched arrays, or time- 
reversal mirrors [23-25]. A conjugate matched array is retrodirective 
because it retransmits the signal with the phases conjugated. If the 
phases are conjugated, it is the same as reversing time in the time 
domain. This is why retrodirective arrays are sometimes called “time- 
reversal” arrays. The acoustic community has aggressively pursued 
time-reversal methods in underwater acoustics as a means for address- 
ing the multipath challenge [26] . 

It is not necessary that the array is linear in order for the array to be 
retrodirective. In fact the Van Atta array is a special case of the more 
general subject of self-phasing arrays. However, in this development, 
we will restrict our discussion to linear arrays. 
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One of the obvious advantages of a retrodirective array is the fact 
that if the array can redirect energy in the direction of arrival then the 
array will work extremely well in a multipath environment. Some sug- 
gested ways of using a retrodirective array in mobile communications 
are given by Fusco and Karode [24]. If the same signal arrives from 
multiple directions, a retrodirective array will retransmit at the same 
angles and the signal will return to the source as if multipath did not 
exist. Figure 4.33 shows a linear retrodirective array in a multipath 
environment. 

If indeed the retrodirective array can retransmit along the angles of 
arrival then the retransmitted signal will retrace the multiple paths 
back to the transmitter. 



4.7.1 Passive retrodirective array 

One possible way of implementing a retrodirective array is shown in 
Fig. 4.34 for an N = 6 element array. A plane wave is incident on the 
array at the angle Oq. 

The array vector for this A^-element array is given as 



g— j^kdsm9 g —j | Ad sin 0 



gj^kdsinO 



gj ^kdsinO 



i T 



(4.62) 




Equal length 
cables 



Figure 4.34 Retrodirective array. 
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The received array vector, at angle do, is given as 



a rec = | g-./! sin 00 g-0'pdsin0 o 



gj\kd sin Gq 



gj | kd sin 6 



(4.63) 



The input to element 6, e jl kdsm0 o } propagates down the transmission 
line to element 1 and is retransmitted. The same process is repeated for 
all elements. Thus the transmitted signal for element i was the received 
signal for element N — i. This can be shown to be the same as multi- 
plying the array vector a in Eq. (4.62) by the reverse of the vector d re cOf 
Eq. (4.63). Reversing the vector of Eq. (4.63) is the same as reversing 
or flipping the individual elements. One method of reversing vector ele- 
ments is through the use of the permutation matrix. In MATLAB, this 
function can be accomplished through the use of the fliplr( ) command. 
The array transmission can be now calculated to be 



AF = fg-0'|^sin0 g-j|fedsin0 



g7‘|^dsin0o 

g7'|^dsin0o 



gJl&dsine g_/|Msin0"| ^ 



g —j\kd sin 0o 
g — 0 \kd sin0 o 



(4.64) 



Based on our derivation in Eq. (4.13) this is equivalent to a beamsteered 
array factor. 

A F= sin <y< sin ‘ ) - sin<, °;) (4.65) 

sin (“(sin 6 — sin0o)) 

Thus, this retrodirective array has successfully retransmitted the signal 
back toward the do direction. This process works regardless of the angle 
of arrival (AOA). Therefore, the retrodirective array serves to focus the 
reflected signal back at the source. 



4.7.2 Active retrodirective array 

A second method for self-phasing or phase conjugation is achieved 
through mixing the received signal with a local oscillator. In this case, 
the local oscillator will be twice the carrier frequency. Although it is not 
necessary to make this restriction, the analysis is easiest for this case. 
Each antenna output has its own mixer as shown in Fig. 4.35. 

The output of the nth antenna is V n (t, d n ), given by 



R,l(t, d n ) = COS(a> 0 £ + dn) 



(4.66) 
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Figure 4.35 Phase conjugation through heterodyne mixing. 



The mixer output is given by 

*S> m ix = C0s(wo t T On) ' COS(2n>o£) 

= ^ [cos(— a> 0 £ + On) + cos(3tt>ot + 6 n )] (4.67) 

After passing through the low pass filter and selecting the lower side- 
band, the transmitted signal, for element n, is given by 

T n (t, On) = cos(a>ot - O n ) (4.68) 

Thus, the phase has been conjugated from the phase imposed at ar- 
rival. This array will then redirect the signal back toward the AOA do- 
Since, the term in Eq. (4.68) could have been written as cos(— coot + O n ). 
It can thus be seen how this procedure is alternatively called time- 
reversal and how the array is called a time-reversal mirror. If we choose 
a different local oscillator frequency, it is sufficient to choose the lower 
sideband for retransmission. 
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Problems 

4.1 Use Eq. (4.14) and MATLAB and plot the array factor in rectangular co- 
ordinate form for the following broadside array cases 

(a) N = 4, d = X/2 
l b ) N= 8,d= X/2 
(c) N = 8, d = X 

4.2 For the three arrays given in Prob. 4.1, calculate d nu u and 6 S for each of 
the arrays given above. 



4.3 Repeat Prob. 4.1 above but for end-fire arrays. 
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4.4 Use MATLAB and the command trapz( ) to calculate the maximum direc- 
tivity for the following two arrays: 

(а) Broadside with N= 8, d = X/2 

(б) End-fire with N = 8, d = X/2 

4.5 Use MATLAB and the command trapzl ) to calculate the maximum direc- 
tivity for the beamsteered array where d = X/2, N = 8. 

(а) 9 0 = 30° 

(б) 9 0 = 45° 



4.6 What is the beamwidth for the following array parameters? 

(а) #o = 0° , N = 8, d = X/2 

(б) 0 O = 45°, N = 8, d = X/2 
(c) <9 0 = 90 o ,W=8, d = X/2 



4.7 For an N = 6, d = X/2 uniformly weighted broadside array, plot the array 
factor. Superimpose plots of the same array with the following weights. (Create 
one new plot for each new set of weights) 

(a) Kaiser-Bessel for a = 2 using kaiser(A r , a) 

(b) Blackman-Harris using blackmanharris(N) 

(c) Nuttall using nuttallwin(N) 

( d ) Chebyshev window for R = 30 dB using chebwin(N, R) 

4.8 Repeat Prob. 4.7 for N = 9, d = X/2. 

4.9 Using MATLAB, create and superimpose three normalized array factor 
plots using the chebwin( ) function for R = 20, 40, and 60 dB. N = 9, d = X/2 

4.10 Using MATLAB, create and superimpose three normalized array factor 
plots using the chebwin( ) function for R = 40 and beamsteer the array to three 
angles such that 9 0 = 0°, 30°, 60°. N=9,d = X/2. 

4.11 For d = X/2, use MATLAB and plot the broadside array beamwidth vs. 
element number N for 2 < N < 20. 

4.12 For d = X/2, use MATLAB and plot the N = 8 element array beamwidth 
for a range of steering angles such that 0 < 6q < 90° . 

4.13 For the N = 40 element circular array with radius a = 2X. Use MAT- 
LAB to plot the elevation plane pattern in the x-z plane when the array is 
beamsteered to 9q = 20°, 40°, and 60°, 4>o — 0- Plot for (0 < 9 < 180°) 

4.14 For the N = 40 element circular array with radius a = 2X. Use MATLAB 
to plot the azimuth plane pattern in the x-y plane when the array is beamsteered 
to 9q = 90°, 4>o = 0°, 40°, and 60°. Plot for (—90° < <j> < 90°) 

4.15 Design and plot a 5 x 5 element array with equal element spacing such 
that d x = d y = .5X. Let the array be beamsteered to 9q = 45° and </>o = 90°. 
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The element weights are chosen to be the Blackman-Harris weights using 
the blackmanharris( ) command in MATLAB. Plot the pattern for the range 

0 < 0 < 7r /2 and 0 < <p < 2n. 

4.16 Use Eq. (4.56) to create scalloped beams for the N — 6-element array 
with d = X/ 2. 

(a) What are the V values? 

(b) What are the angles of the scalloped beams? 

(c) Plot and superimpose all beams on a polar plot similar to Fig. 4.29 

4.17 For the fixed beam sidelobe canceller in Sec. 4.8, with N = 3-antenna 
elements, calculate the array weights to receive the desired signal at do = 30°, 
and to suppress the interfering signals arriving at = —30° and 62 = —60°. 
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Chapter 

5 

Principles of Random 
Variables and Processes 



Every wireless communication system or radar system must take into 
account the noise-like nature of the arriving signals as well as the in- 
ternal system noise. The arriving intelligent signals are usually altered 
by propagation, spherical spreading, absorption, diffraction, scattering, 
and/or reflection from various objects. This being the case, it is impor- 
tant to know the statistical properties of the propagation channel as 
well as the statistical properties of the noise internal to the system. 
Chapter 6 will address the subject of multipath propagation, which will 
be seen to be a random process. Chapters 7 and 8 will deal with system 
noise as well as the statistical properties of arriving signals. In addition, 
methodologies used in the remaining chapters will require computa- 
tions based upon the assumption that the signals and noise are random. 

It is thus assumed that students or researchers using this book have 
a working familiarity with random processes. Typically this material 
is covered in any undergraduate course in statistical topics or com- 
munications. However, for the purposes of consistency in this text, we 
will perform a brief review of some fundamentals of random processes. 
Subsequent chapters will apply these principles to specific problems. 

Several books are devoted to the subject of random processes includ- 
ing texts by Papoulis [1], Peebles [2], Thomas [3], Schwartz [4], and 
Haykin [5]. The treatment of the subject in this chapter is based upon 
the principles discussed in depth in these references. 

5.1 Definition of Random Variables 

In the context of communication systems, the received voltages, cur- 
rents, phases, time delays, and angles-of-arrival tend to be random 

105 
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variables. As an example, if one were to conduct measurements of the 
receiver phase every time the receiver is turned on, the numbers mea- 
sured would tend to be randomly distributed between 0 and 2tt. One 
could not say with certainty what value the next measurement will 
produce but one could state the probability of getting a certain mea- 
sured value. A random variable is a function that describes all pos- 
sible outcomes of an experiment. In general, some values of a ran- 
dom variable are more likely to be measured than other values. The 
probability of getting a specific number when rolling a die is equal 
to the probability of any other number. However, most random vari- 
ables in communications problems do not have equally likely proba- 
bilities. Random variables can either be discrete or continuous vari- 
ables. A random variable is discrete if the variable can only take on 
a finite number of values during an observation interval. An exam- 
ple of a discrete random variable might be the arrival angle for indoor 
multipath propagation. A random variable is continuous if the vari- 
able can take on a continuum of values during an observation inter- 
val. An example of a continuous random variable might be the voltage 
associated with receiver noise or the phase of an arriving signal. Since 
random variables are the result of random phenomena, it is often best 
to describe the behavior of random variables using probability density 
functions. 



5.2 Probability Density Functions 

Every random variable x is characterized by a probability density func- 
tion p(x). The probability density function (pdf) is established after 
a large number of measurements have been performed, which deter- 
mine the likelihood of all possible values of x. A discrete random vari- 
able possesses a discrete pdf. A continuous random variable possesses 
a continuous pdf. Figure 5.1 shows a typical pdf for a discrete ran- 
dom variable. Figure 5.2 shows a typical pdf for a continuous random 
variable. 

The probability that x will take on a range of values between two 
limits xi and X 2 is defined by 



P(xi < x < X 2 ) = 



r* 2 



p(x)dx 



> X\ 



(5.1) 



There are two important properties for pdfs. First, no event can have 
a negative probability. Thus 



p{x) > 0 



(5.2) 
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Figure 5.1 Probability density function for discrete x values. 



Second, the probability that an x value exists somewhere over its range 
of values is certain. Thus 



p(x) dx — 1 



( 5 . 3 ) 



Both properties must be satisfied by any pdf. Since the total area under 
the pdf is equal to 1, the probability of x existing over a finite range of 
possible values is always less than 1. 




Figure 5.2 Probability density function for continuous x values. 
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5.3 Expectation and Moments 

It is valuable to understand various properties of the random variable x 
or various properties of functions of the random variable x. The most 
obvious property is the statistical average. The statistical average is 
defined as the expected value denoted by E. Thus, the expected value 
of x is defined as 

/ OO 

xp(x)dx (5.4) 

-OO 

Not only can we find the expected value of x but we can also find the 
expected value of any function of x. Thus 

/ OO 

f(x)p{x)dx (5.5) 

-OO 



The function of x could be x 2 , x 3 , cos(x), or any other operation on the 
random variable. The expected value of x is typically called the first 
moment denoted as m\. 



m\ 



xp(x)dx 



(5.6) 



The nth moment is defined as the expected value of x n , thus, 

/ OO 

x n p{x)dx (5.7) 



The concept of moments is borrowed from the terminology of moments 
in mechanics. 

If the random variable was expressed in volts, the first moment would 
correspond to the average, mean, or dc voltage. The second moment 
would correspond to the average power. 

The spreading about the first moment is called the variance and is 
defined as 

/ OO 

(x — m\) 2 p(x) dx (5.8) 

-OO 



The standard deviation is denoted by a and is defined as the spread 
about the mean, thus, 



a = \j~g2 (5.9) 

By expanding the squared term in Eq. (5.8) it can be shown that a — 
m 2 — m 2 . 

The first moment and standard deviation tend to be the most use- 
ful descriptors of the behavior of a random variable. However, other 
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moments may need to be calculated to understand more fully the 
behavior of a given random variable x. Since the calculation of each 
new moment requires a reevaluation Eq. (5.7), sometimes it is useful to 
utilize the moment generating function that will simplify the calcula- 
tion of multiple moments. The moment generating function is defined as 

/ OO 

e sx p(x) dx — Fis) (5.10) 

-OO 

The moment generating function resembles the Laplace transform of 
the pdf. This now brings us to the moment theorem. If we differentiate 
Eq. (5.10) n times with respect to s, it can be shown that 

F'\s) = E[x n e sx ] (5.11) 



Thus, when s = 0, we can derive the nth moment as shown in Eq. (5.12) 

F n (0) = E[x n ) = m n (5.12) 

Example 5.1 If the discrete pdf of the random variable is given by p(x) = 
.5 [Six + 1) + Six — 1)], what are the first three moments using the moment 
generating function Fis)? 

Solution Finding the moment generating function we have 

/ OO 

e sx (.5)[<5(x + 1) + <5(x — 1 )\dx = .5[e~ s + e s ] 

-OO 

= cosh(s) 

The first moment is given as 



m l = F 1 (s )| s=0 = 



dcosh(s) 

ds 



= sinh(O) = 0 



s = 0 



The second moment is given as 

m 2 — F 2 is)\ s= o = cosh(O) = 1 
The third moment is given as 

m 3 = F 3 is) | s= o = sinh(O) = 0 



5.4 Common Probability Density Functions 

There are numerous pdfs commonly used in both radar, sonar, and 
communications. These pdfs describe the characteristics of the receiver 
noise, the arriving signal from multipaths, the distribution of the phase, 
envelope, and power of arriving signals. A quick summary of these pdfs 
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and their behavior will be useful in supporting concepts addressed in 
Chaps. 6, 7, and 8. 

5.4.1 Gaussian density 

The Gaussian or normal probability density is perhaps the most com- 
mon pdf. The Gaussian distribution generally defines the behavior of 
noise in receivers and also the nature of the random amplitudes of 
arriving multipath signals. According to the Central Limit Theorem, 
the sum of numerous continuous random variables as the number in- 
creases, tends toward a Gaussian distribution. The Gaussian density is 
defined as 



This is a bell-shaped curve, which is symmetric about the mean 
value x 0 , and has a standard deviation of a. A plot of a typical Gaussian 
distribution is shown in Fig. 5.3. 

Example 5.2 For the Gaussian probability density function with x 0 = 0 and 
o = 2, calculate the probability that x will exist over the range 0 < x < 4. 

Solution Invoking Eqs. (5.1) and (5.13) we can find the probability 




e 2 » 2 — oo < x < oo 



(5.13) 




- L e -* 2 /8 = .477 



H, 

Ci. 




X 



Figure 5.3 Gaussian density function. 
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H, 

O, 




X 



Figure 5.4 Rayleigh density function. 



5.4.2 Rayleigh density 

The Rayleigh probability density generally results when one finds the 
envelope of two independent Gaussian processes. This envelope can be 
found at the output of a linear filter where the inputs are Gaussian 
random variables. Rayleigh distributions are normally attributed to 
the envelope of multipath signals when there is no direct path. The 
Rayleigh distribution is defined as 



The standard deviation can be shown to be a . A plot of a typical Rayleigh 
distribution is shown in Fig. 5.4. 

Example 5.3 For the Rayleigh probability density function with a = 2, cal- 
culate the probability that x will exist over the range 0 < x < 4. 

Solution Invoking Eqs. (5.1) and (5.14) we can find the probability 



5.4.3 Uniform density 

The uniform distribution is normally attributed to the distribution of 
the random phase for propagating signals. Not only does the phase 
delay tend to be uniformly distributed but often the angles of arrival 




(5.14) 




•4 



P(0 < x < 4) = 
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for diverse propagating waves can also take on a uniform distribution. 
The uniform distribution is defined as 



This mean value can be shown to be (a + b)/ 2. A plot of a typical uniform 
distribution is shown in Fig. 5.5. 

Example 5.4 For the uniform distribution where a — —2, and 6 = 2, use the 
moment generating function to find the first three moments. 

Solution By substituting Eq. (5.15) into Eq. (5.10) we have 



p(x) = [u{x — a) — u(x — b)] a < x < b 

b — a 



(5.15) 




= — sinh(2s) 
2s 

The first moment is found as 




sinh(2s) 

2s 2 



The second moment is found as 




2cosh(2s) 4 



s s 3 



s 2 3 



h, 

a. 



a 



b 



x 



Figure 5.5 Uniform density function. 
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The third moment is found as 



?n 3 = ,F 3 (s)| s= o = cosh(2s) 



'4 6 ' 




'6 3 


s s d 


— sinh(2s) 


s 2 s 4 



= 0 



5.4.4 Exponential density 

The exponential density function is sometimes used to describe the 
angles of arrival for incoming signals. It can also be used to describe 
the power distribution for a Rayleigh process. The Exponential density 
is the Erlang density when n — 1 ([1]) and is defined by 

p(x) = —e~ xla x > 0 (5.16) 

a 

The mean value can be shown to be a . The standard deviation can 
also be shown to be a . The literature sometimes replaces cr with 2<r 2 
in Eq. (5.16). A plot of a typical exponential distribution is shown in 
Fig. 5.6. 



Example 5.5 Using the exponential density with a = 2, calculate the prob- 
ability that 2 < x < 4 and find the first two moments using the moment 
generating function. 

Solution The probability is given as 

/*4 1 

P{2 < x < 4) = / -e~ xa/a) dx = .233 
J 2 a 




Figure 5.6 Exponential density function. 
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The moment generating function is derived as 

0° -j 

F(s) = / -e~ x ^- s) dx = 

7o a l- as 

Finding the first moment 



FHs) = 



(1 — as) 2 



= a =2 



s=0 



Finding the second moment 



F 21 (s) = 



2<t^ 

(1 — crs) 3 



s=0 



= 2a z = 8 



5.4.5 Rician density 

The Rician distribution is common for propagation channels where 
there is a direct path signal added to the multipath signals. The 
direct path inserts a nonrandom carrier thereby modifying the Rayleigh 
distribution. Details of the derivation of the Rician distribution can be 
found in [1, 4], The Rician distribution is defined as 

x (x 2 +a 2 ) ( xA\ 

?(*) = - 2 e 2 " 2 fo ( ~^2 ) x>0, A>0 (5.17) 

where /o( ) is the Modified Bessel function of first kind and zero-order. 
A plot of a typical Rician distribution is shown in Fig. 5.7. 

Example 5.6 For the Rician distribution with a = 2 and A = 2, what is the 




X 



Figure 5.7 Rician density function. 
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x 



Figure 5.8 Laplace density function, 
probability that x > 5? 

Solution Using Eqs. (5.17) and (5.1) we have the probability as 
P{x > 5) = j ’ I 0 dx = .121 



5.4.6 Laplace density 

The Laplace density function is generally attributed to the distribution 
of indoor or congested urban angles of arrival. The Laplace distribution 
is given as 



p(x ) = 



V2 o 



,- 1^1 



— OO < X < oo 



(5.18) 



Since the Laplace distribution is symmetric about the origin, the first 
moment is zero. The second moment can be shown to be a 2 . A plot of 
the Laplace distribution is shown in Fig. 5.8. 

There are many other probability density functions described in the 
literature but the six functions mentioned in this chapter are the most 
common distribution functions applied to wireless communication 
problems. 



5.5 Stationarity and Ergodicity 

In realistic applications, we may know the statistical properties of the 
signals and noise but we are often confronted with the challenge of per- 
forming operations on a limited block of sampled data. If the statistical 
mean mi is the average value of the random variable x, one might 
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intuitively assume that the time average would be equal to the sta- 
tistical average. We can estimate the statistical average by using the 
time average over a block length T. The time average for the random 
variable x can be written as 

1 f T 

x — — / x(t)dt (5.19) 

T Jo 

where x is the estimate of the statistical average of x. 

If the data is sampled data, Eq. (5.19) can be rewritten as a series to 
be expressed as 




K 

Yxik) 

k=i 



(5.20) 



Since the random variable x(t) is changing with time, one might 
expect that the estimate x might also vary with time depending on 
the block length T. Since we are performing a linear operation on the 
random variable x, we will produce a new random variable x. One might 
expect that the time average and the statistical average would be sim- 
ilar if not identical. We can take the expected value of both sides of 
Eq. (5.19) 



E[x] = 



— / E[x(t)]dt 
t Jo 



(5.21) 



If all statistics of the random variable x do not change with time, the 
random process is said to be strict-sense stationary [1], A strict sense 
stationary process is one in which the statistical properties are invari- 
ant to a shift in the time origin. If the mean value of a random variable 
does not change with time, the process is said to be wide-sense station- 
ary. If x is wide-sense stationary, Eq. (5.21) simplifies to 



E[x\ — E[x{t)] = m\ 



(5.22) 



In reality, the statistics might change for short blocks of time T but 
stabilize over longer blocks of time. If by increasing T (or K) we can 
force the time average estimate to converge to the statistical average, 
the process is said to be ergodic in the mean or mean-ergodic. This can 
be written as 

1 f T 

lim x = lim — / x{t)dt — mi (5.23) 

T~*o o T—>oo T J 0 

or 



1 

lim x = lim — > x(k ) = m\ 

T—>oo T— >oo K ' 

k=l 



(5.24) 
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In a similar way, we can also use a time average to estimate the variance 
of x defined as 

i r T 

erf = — J ( x(t)-x) 2 dt (5.25) 

If the data is sampled data, Eq. (5.25) can be rewritten as a series to be 
expressed as 



d 2 = ^XxUi) - x) 2 (5.26) 

A k=i 

If by increasing T (or K) we can force the variance estimate to con- 
verge to the statistical variance, the process is said to be ergodic in the 
variance or variance-ergodic . This can be written as 

(x(t) — x)dt — erf (5.27) 

or 

1 K 

lim <f 2 = lim — S^ixik) - x) 2 = erf (5.28) 

k=l 

In summary, stationary processes are ones in which the statistics of 
the random variables do not change at different times. Ergodic pro- 
cesses are ones where it is possible to estimate the statistics, such as 
mean, variance, and autocorrelation, from the measured values in time. 
Stationarity and ergodicity will prove valuable in practical communi- 
cation systems because, under certain conditions, one can reliably esti- 
mate the mean, variance, and other parameters based upon computing 
the time averages. 

5.6 Autocorrelation and Power 
Spectral Density 

It is valuable to know how well a random variable correlates with itself 
at different points in time. That is, how does x at the time t\ correlate 
with x at the time t‘p. This correlation is defined as an autocorrelation 
since we are correlating x with itself. The autocorrelation is normally 
written as 



lim 



= lim — 

T-+ oo T 



R x (ti, h) — E[x(ti)x(h)] (5.29) 

If the random variable x is wide-sense stationary, the specific values 
of t\ and t ‘2 are not as important as the time interval between these 
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two values defined by r. Thus, for a wide-sense stationary process, the 
autocorrelation can be rewritten as 

R x (x) = E[x(t)x(t + r)] (5.30) 

It should be noted that the autocorrelation value at r = 0 is the second 
moment. Thus, 

R x ( 0) = E[x 2 ] — m 2 - 

Again, in practical systems where we are constrained to process lim- 
ited blocks of data, one is forced to estimate the autocorrelation based 
upon using a time average. Therefore, the estimate of the autocorrela- 
tion can be defined as 

1 f T 

Rx(r) = — / x(t)x(t + x)dt (5.31) 

1 J o 

If the data is sampled data, Eq. (5.31) can be rewritten as a series to 
be expressed as 

1 K 

R x ( n ) = — x{k)x{k + n) (5.32) 

k=l 

If by increasing T (or K) we can force the autocorrelation estimate 
to converge to the statistical autocorrelation, the process is said to be 
ergodic in the autocorrelation or autocorrelation-ergodic. This can be 
written as 

1 ( T 

lim R x ( x) = lim — / x(t)x(t + x) dt — R x (x) (5.33) 

T-^oo T— >oo 1 J q 

It should be noted that the units of the autocorrelation function for 
electrical systems are normally expressed in watts. Thus Rx( 0) yields 
the average power of the random variable x. 

As with normal signals and linear systems, it is instructive to under- 
stand the behavior of the spectrum of the random variable x. Such pa- 
rameters as bandwidth and center frequency help the system designer 
to understand how to best process the desired signal. The autocorrela- 
tion itself is a function of the time delay between two time-separated 
random variables. Thus, the autocorrelation is subject to Fourier anal- 
ysis. Let us define the power spectral density as the Fourier transform 
of the autocorrelation function. 

/ OO 

R x (x)e- j2nfr dx (5.34) 

-OO 

/ OO 

S x (f)e j2wfT df (5.35) 

-OO 
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The Fourier transform pair in Eqs. (5.34) and (5.35) is frequently re- 
ferred to as the Wiener -Khinchin pair [6] . 

5.7 Correlation Matrix 

In the previous treatment of random variables, we assumed that only 
one random variable x existed and we performed expectation operations 
on these scalar values. Several circumstances arise when a collection of 
random variables exist. One such example is the output of each element 
of an antenna array. If an incoming planewave induces a random voltage 
on all M-array elements, the received signal x is a vector. Using the 
notation of Chap. 4 we can describe the array element output voltages 
for one incident planewave. 



x(t) = aid) ■ sit) (5.36) 

where sit) — incident monochromatic signal at time t 
aid) — M-element array steering vector for the 9 
direction of arrival. 

Let us now define the M x M-array correlation matrix R xx as 

Rxx = E[x ■ x H ] — E[(as)(s*a ff )] 

= aE[\s\ 2 ]a H (5.37) 

= Sa -a H 

where ( ) H indicates the Hermitian transpose and S — E[|s| 2 ]. 

The correlation matrix in Eq. (5.37) assumes that we are calculating 
the ensemble average using the expectation operator E[ ]. It should be 
noted that this is not a vector autocorrelation because we have imposed 
no time delay in the vector x. 

For realistic systems where we have a finite data block, we must re- 
sort to estimating the correlation matrix using a time average. There- 
fore, we can re-express the operation in Eq. (5.37) as 

R X x — 4 [ xit ) ■ xit) H dt = - ^ [ \s(t)\ 2 dt (5.38) 

f JO 7 J 0 

If the data is sampled data, Eq. (5.38) can be rewritten as a series to be 
expressed as 



Rxx — 



-H K 

-£ 

k=i 



sik)\< 



K 



(5.39) 
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If by increasing T (or K) we can force the correlation matrix estimate 
to converge to the statistical correlation matrix, the process is said to 
be ergodic in the correlation matrix. This can be written as 

1 r T _ 

lim R xx i t) = lim — / x(t)x(t) H dt — R xx (5.40) 

T->oo T—>o o 1 J q 

The correlation matrix will be heavily used in Chaps. 7 and 8. 
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Problems 

5.1 For the discrete probability density function given as 

p(x) = ^ [5(jc) + Six - 1) + S(x - 2)] 
o 

ia) What is the moment generating function F(s ) 

(b) Calculate the first two moments using Eq. (5.7) 

(c) Calculate the first two moments using the moment generating function 
and Eq. (5.12) 

5.2 For the Gaussian density with a = 1, and xq = 3 

(a) Use MATLAB and plot the function for — 10 < x < 10. 

(b) What is the probability Fix < 2)? 

5.3 For the Rayleigh density with a = 2 

(a) Use MATLAB and plot the function for 0 < x < 10. 

ib) What is the probability Fix > 2)? 

5.4 For the uniform density with a = 0, b = 5 

ia) Find the first two moments using Eq. (5.7) 

ib) Find the first two moments using Eq. (5.12) 

5.5 For the exponential density with o = 2 

(a) Use MATLAB and plot the function for 0 < x < 5. 
ib) What is the probability Fix > 2)? 
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5.6 For the Rician density with a = 3 and A = 5 

(а) Use MATLAB and plot the function for 0 < x < 10. 

(б) What is the probability P(x > 5)? 

5.7 For the Laplace density with a = 2 

(а) Use MATLAB and plot the function for — 6 < x < 6. 

(б) What is the probability P(x < —2)? 

5.8 Create the 30 sample zero mean Gaussian random variable x using 
MATLAB such that the a = 2. This is done by the command x = cr*randn 
(1, 30). This is a discrete time series of block length K = 30. 

(а) Use Eq. (5.20) to estimate the mean value 

(б) Use Eq. (5.25) to estimate the standard deviation a x 

(c) What is the percent error between these estimates and the mean and 
standard deviation for the true Gaussian process? 

5.9 Use the same sequence from Prob. 8 

(а) Use the MATLAB xcorr( ) command to calculate and plot the autocor- 
relation Rx(n) ofx. 

(б) Use the FFT command and fftshift to calculate the power spectral 
density of RAn). Plot the absolute value. 

5.10 For the N= 2 element array with elements spaced A/2 apart 

(а) What is the array steering vector for the 9 — 30° ? 

(б) Define the time signal impinging upon the array as s(t) = 2exp(nt/T). 
Use Eq. (5.38) and calculate the array correlation matrix. 
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Chapter 

6 

Propagation Channel 
Characteristics 



Free-space transmission occurs when the received signal is exclusively 
the result of direct path propagation. In this case there is no interfer- 
ence at the receiver caused by multipath signals. The received signal 
strength calculations are straightforward and deterministic. The free- 
space transmission model is a useful construct which can be used to 
understand fundamental propagation behavior. However, the free-space 
model is unrealistic because it fails to account for the numerous terres- 
trial effects of multipath propagation. It is normally assumed in this 
chapter that the propagation channel includes at least two propagation 
paths. 

A channel is defined as the communication path between transmit 
and receive antennas. The channel accounts for all possible propaga- 
tion paths as well as the effects of absorption, spherical spreading, 
attenuation, reflection losses, Faraday rotation, scintillation, polariza- 
tion dependence, delay spread, angular spread, Doppler spread, disper- 
sion, interference, motion, and fading. It may not be necessary that any 
one channel has all of the above effects but often channels have multi- 
ple influences on communication waveforms. Obviously, the complexity 
of the channel increases as the number of available propagation paths 
increases. It also becomes more complex if one or more variables vary 
with time such as the receiver or transmitter position. Several excellent 
texts exist which describe channel characteristics [1—9]. 

Indoor propagation modeling can be a formidable challenge. This 
is partly due to the regular and periodic location of structures such 
as windows, doors, wall studs, ceiling tiles, electrical conduits, ducts, 
and plumbing. This is also partly due to the very close proximity of 
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scattering objects relative to the transmitter and/or receiver. An excel- 
lent treatment of indoor propagation channels can be found in Sarkar 
et al. [5], Shankar [3], and Rappaport [9]. This chapter will preview 
channel characterization basics but will assume outdoor propagation 
conditions. 



6.1 Flat Earth Model 

In Chap. 2 we discussed propagation over flat earth. This propagation 
model was simplistic but it demonstrated the elements of more com- 
plex propagation scenarios. We will repeat Fig. 2.17 as Fig. 6.1. The 
transmitter is at height hi while the receiver is at height h 2 . The direct 
path term has path length r\. There is also a reflected term due to the 
presence of a ground plane, this is called the indirect path. Its overall 
path length is r 2 . The ground causes a reflection at the point y with a 
reflection coefficient R. The reflection coefficient R is normally complex 
and can alternatively be expressed as R = \R\e J '-' . The reflection coeffi- 
cient can be found from the Fresnel reflection coefficients as described 
in Eq. (2.63) or (2.69). The reflection coefficient is polarization depen- 
dent. One expression is used for parallel polarization where the E field 
is parallel to the plane of incidence (E is perpendicular to the ground). 
The other expression is for perpendicular polarization where the E field 
is perpendicular to the plane of incidence (E is parallel to the ground). 

Through simple algebra, it can be shown that 



ri = \Jd 2 + (h 2 - hi) 2 


(6.1) 


r 2 — \/d 2 + (h 2 + hi) 2 


(6.2) 




Figure 6.1 Propagation over flat earth. 
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The total received phasor field can be found to be 

+ (6.3, 
r\ i"2 

where E rs — phasor representation of the received electric field E r . 

The reflection point is given by y — dh\l(h\ +h 2 ). Therefore, the angles 



of arrival may be calculated to be 






6i — tan -1 | 


ni 2 -h 1 \ 


(6.4) 


V d ) 


02 = tan -1 | 


( h-2 + hi \ 


(6.5) 


V d ) 



Lastly, the time delays of arrival are given by n = ^ and t 2 = y, where 
c is the speed of propagation. 

If we make the assumption that the antenna heights hi, h-> <5C r, , r 2 , 
we can use a binomial expansion on Eqs. (6.1) and (6.2) to simplify the 
distance expressions. 



n — Vd 2 + U12 - hi) 2 ss d + ^ 1 2 f 1 * 

2d 



r 2 — Vd 2 + U 12 + hi) 2 ss d + 



(/i 2 + /h) 2 
2d 



(6.6) 

(6.7) 



If we additionally assume that r\ ~ r 2 , we can substitute Eqs. (6.6) and 
(6.7) into Eq. (6.3) to get 



E rt< — 



E 0 e 



-jkn r 



r i 

E 0 e~’ kr i 

n 

E 0 e-’ kn 

n 



1 + Re- Jk(r2 ~ ri) 

1 + |i?|e 






(6.8) 



1 + |iJ| ( cos ( k 21 ^ - f 



. . , 2h\ii2 
■ J sin ( k— f 



Equation (6.8) is in phasor form. We can transform this phasor back into 
the time domain by finding Re\E rf; e J " d }. Through the use of Euler iden- 
tities and some manipulation, we can write the time domain expression 
as 



E r (t) = — 
n 



|i?| cos (k X ^ 2 - ijf'j ^ cos(ad — kr\) 



2 h Ji 

|i?| sin ( k — — \fr ) sin(ftd — kr\) 



(6.9) 
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This solution is composed of two quadrature sinusoidal signals har- 
monically interacting with each other. If R = 0, Eq. (6.9) reverts to the 
direct path solution. Equation (6.9) is of the general form 



Xcos (cot — kr \ ) + Y sin(«;£ — kr{) = Acos(o>£ — kri + <p) (6.10) 

where X = f»(l + |i? | cos(&^ - f)) 

T = ^\R\sin{k^ - f) 

A — \/X 2 + Y 2 = signal envelope 
< p — tan -1 (^) = signal phase 

Thus, the envelope and phase of Eq. (6.9) is given by 



E 0 H 7 {2kh 1 h 2 ( ~ . / 2kh\h 2 

A= ttV ( 1 + |fl|cos (~^ *)) + (' |sm (^r~ 

,( sin (**¥•-*) \ 

(, b — tan 1 — 

Vf (l + 1*1 cos ) 

where ^ = direct path amplitude. 




( 6 . 11 ) 

( 6 . 12 ) 




Direct path 
amplitude 



Figure 6.2 Envelope of direct and reflected signals over flat earth. 
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Direct path 
phase 



Figure 6.3 Phase of direct and reflected signals over flat earth. 



Example 6.1 Assuming the following values: |i?| = . 3, .6, .9, f = 0, E 0 /r i = 1, 
hi — 5 m, h ,2 — 20 in, d — 100 m . What are the time delays t\ and Also, 
plot a family of curves for the envelope and the phase vs. ( 2 khih 2 )/d 

Solution Using Eqs. (6.6) and (6.7) we can calculate the distances = 
101.12 m and r-i = 103.08 m. The corresponding time delays are x\ — .337 us 
and T 2 = .344 ps. The plots of the envelope and phase are shown in Figs. (6.11) 
and (6.12) 

Figures 6.2 and 6.3 demonstrate that if any one of the abscissa vari- 
ables change, whether antenna heights {hi, h 2 ), antenna distance id), 
or the wavenumber k, the received signal will undergo significant inter- 
ference effects. In general, the total received signal can fluctuate any- 
where between zero and up to twice the amplitude of the direct path 
component. The phase angle can change anywhere between —90° and 
90°. If antenna 1 represents a mobile phone in a moving vehicle, one 
can see how multipath can cause interference resulting in signal fad- 
ing. Fading is defined as the waxing and waning of the received signal 
strength. 

6.2 Multipath Propagation Mechanisms 

The flat earth model discussed earlier is a deterministic model that 
is instructive in demonstrating the principles of basic interference. In 
the case of the flat earth model, we know the received signal voltage 
at all times. In typical multipath/channel scenarios, the distribution 
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of large numbers of reflecting, diffracting, refracting, and scattering 
objects becomes random. In this case, numerous multiple paths can 
be created and it becomes extremely difficult to attempt to model the 
channel deterministically. Thus, one must revert to a statistical model 
for estimating signal and channel behavior. 

Figure 6.4 shows a propagation channel with several candidate mech- 
anisms for creating multiple propagation paths. Let us define these 
mechanisms. 

■ Scattering: Scattering occurs when the electromagnetic signal strikes 
objects that are much smaller than a wavelength. These objects could 
be water droplets, clouds, or insects for example. In the electromag- 
netic community this mechanism is often termed Rayleigh scattering. 
(This is not to be confused with Rayleigh fading although the two phe- 
nomena are interrelated). 

■ Refraction: Refraction occurs when an electromagnetic signal propa- 
gates through a structure. The propagation path is diverted because 
of the difference in the electrical properties of the medium. Boundary 
conditions help to determine the extent of the refraction. 

■ Reflection: Reflection occurs when an electromagnetic signal strikes 
a smooth surface at an angle and is reflected toward the receiver. The 
angle of reflection is equal to the angle of incidence under normal 
conditions. 

■ Diffraction: Diffraction occurs when the electromagnetic signal 
strikes an edge or corner of a structure that is large in terms of 
wavelength. The incident ray is diffracted in a cone of rays follow- 
ing Keller’s laws of diffraction [10]. 

The various mechanisms such as scattering, refraction, reflection, 
and diffraction give rise to alternate propagation paths such that the re- 
ceived signal is a composite of numerous replicas all differing in phase, 
amplitude, and in time delay. Thus, the multipath signal amplitudes, 
phases, and time delays become random variables. 




More Books Visit : www.iqbalkalmati.blogspot.com 



Propagation Channel Characteristics 129 



6.3 Propagation Channel Basics 

In order to better understand the performance of a wireless signal prop- 
agating in a typical outdoor environment, it is necessary to define some 
terms. These terms are commonly used to describe properties or char- 
acteristics of the channel. 

6.3.1 Fading 

Fading is a term used to describe the fluctuations in a received signal as 
a result of multipath components. Several replicas of the signal arrive 
at the receiver, having traversed different propagation paths, adding 
constructively and destructively. The fading can be defined as fast or 
slow fading. Additionally, fading can be defined as flat or frequency 
selective fading. 

Fast fading is propagation which is characterized by rapid fluctu- 
ations over very short distances. This fading is due to scattering from 
nearby objects and thus is termed small-scale fading. Typically fast fad- 
ing can be observed up to half-wavelength distances. When there is no 
direct path (line-of-sight), a Rayleigh distribution tends to best fit this 
fading scenario, thus fast fading is sometimes referred to as Rayleigh 
fading. When there is a direct path or a dominant path, fast fading can 
be modeled with a Rician distribution. 

Slow fading is propagation which is characterized by slow varia- 
tions in the mean value of the signal. This fading is due to scattering 
from the more distant and larger objects and thus is termed large-scale 
fading. Typically slow fading is the trend in signal amplitude as the 
mobile user travels over large distances relative to a wavelength. The 
slow fading mean value is generally found by averaging the signal over 
10 to 30 wavelengths [11]. A log-normal distribution tends to best fit 
this fading scenario, thus slow fading is sometimes referred to as log- 
normal fading. Figure 6.5 shows a superimposed plot of fast and slow 
fading. 

Flat fading is when the frequency response of the channel is flat 
relative to the frequency of the transmit signal, that is, the channel 
bandwidth Be is greater than the signal bandwidth BsiBc > Bs). Thus, 
the multipath characteristics of the channel preserve the signal quality 
at the receiver. 

Frequency selective fading is when the channel bandwidth B c is less 
than the signal bandwidth Bg (B c < B s ). In this case, the multipath 
delays start to become a significant portion of the transmit signal time 
duration and dispersion occurs. 

Fast fading is of particular interest to the electrical engineer because 
the resulting rapid fluctuations can cause severe problems in reliably 
maintaining communication. 
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Figure 6.5 Slow and fast fading examples. 

6.3.2 Fast fading modeling 

Multipath with no direct path. Based upon the scenario as given in Fig. 6.4, 
we will assume that no direct path exists but that the entire received 
electric field is based upon multipath propagation. We can express the 
received voltage phasor as the sum of all the possible multipath com- 
ponent voltages within the receiver. 



where a n — random amplitude of the nth path 

a n = random phase associated with the nth path 
r n = length of nth path 

(pn — n -}- OLji 

If we assume a large number of scattering structures N, which are 
randomly distributed, we can assume that the phases cp n are uniformly 
distributed. We can express the time-domain version of the received 
voltage as 



N 



N 




(6.13) 



N 

Vr = '^2a n cosicoot + <p n ) 



N 



N 



^a„cos(</>„)cos(tt> 0 £) - ^a„sin((/>„)sin(n>o£) (6.14) 
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We may further simplify Eq. (6. 14) as we did in Eq. (6.10) using a simple 
trigonometric identity 

u r — Xcos(a>ot) — Y sin(a>ot) = r cos(o>ot + (p) (6.15) 

where X = Y,n=i a « cos (</>„) 

Y = J2n=i a„sin(0„) 
r = y/X 2 + Y 2 — envelope 
</> = tan -1 (|) 

In the limit, as N —>■ oo, the Central Limit Theorem dictates that the 
random variables X and Y will follow a Gaussian distribution with zero 
mean and standard deviation a . The phase (j> can also be modeled as a 
uniform distribution such that p((p) = ^ for 0 < </> < 2 jt. The envelope 
r is the result of a transformation of the random variables X and Y and 
can be shown to follow a Rayleigh distribution as given by Schwarz [12] 
or Papoulis [13]. The Rayleigh probability density function is defined as 

r r 2 

p(r) — -^e 2 ° 2 r > 0 (6.16) 

a 2 

where a 2 is the variance of the Gaussian random variables X or Y. 

Figure 6.6 shows a Rayleigh distribution for two different standard 
deviations. Although the mean value of X or Y is zero, the mean value 
of the Rayleigh distribution is a \/tt/2. 




Envelope r (mV) 



Figure 6.6 Rayleigh probability density. 
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0 5 10 15 

Envelope r (mV) 

Figure 6.7 Rayleigh probability density with indicated threshold. 



Example 6.2 For the Rayleigh-fading channel where a = .003 V, what is 
the probability that the received voltage envelope will exceed a threshold of 
5 mV? 

Solution The probability of the envelope as exceeding 5 mV is given by 

/*OG 2 

Pir > .005) = / -=e~^dr = . 249 

J. 005 

This can be shown as the shaded area under the curve in the Fig. 6.7. 



It can be shown that if the envelope is Rayleigh distributed, the power 
p (watts) will have an exponential distribution [3, 13] (Also called an 
Erlang distribution with n — 1). 



pip) = —~^e 2„2 p > 0 (6.17) 

2 er z 

The average value of the power is given by 




Elp] = po = 



p ■ pi p)dp 
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Thus p 0 = 2 ct 2 and can be substituted in Eq. (6.17). The power distri- 
bution is plotted in Fig. 6.8 for p 0 = 2 pW, 4 pW. 

The minimum detectable threshold power in a receiver is Pth- This 
power level is dictated by the receiver noise floor, noise figure, and 
detector thresholds. If the received power falls below the threshold, the 
receiver goes into “outage” because the backend signal-to-noise ratio is 
insufficient. The outage probability is the probability that the received 
power is too small for detection. This is given by 

r p th i _ JL 

P(p<Pth)= / — e p « dp (6.19) 

Jo Po 

Example 6.3 What is the outage probability of a Rayleigh channel if the 

average power is 2 pW and the threshold power is 1 pW? 

Solution The outage probability is given by 

r 1 ^ i P 

P(p<lpW)= / — g _ 2irw dp — 0.393 

Jo 2 hW 

This can be shown as the shaded area under the curve in the Fig. 6.9. 

Multipath with direct path. Let us now consider that a direct path is 
present as indicated in Fig. 6.10. If a direct path is allowed in the 
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Figure 6.9 Outage probability shown as area under curve. 



received voltage, we must modify Eqs. (6.14) and (6.15) by adding the 
direct path term with the direct path amplitude A (volts). 

N 

V r — AcOS(&>ot) + 'Y2 O-n COSidOQt + </>„) 
n= 1 



N 

£ 

n= 1 



a n cos {(p n ) 



N 

cos (a> 0 t) — ^a„sin(0„) sin(a> 0 ^) 

n=l 



( 6 . 20 ) 




Figure 6.10 A multipath channel with a direct path. 
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Again, the envelope r = -J X 2 + Y' 2 . We now must accordingly revise 
the random variables X and Y. 



N 



X — A+ y^q, t cos(</>„) 



n= 1 



N 



( 6 . 21 ) 



Y = y^a n sin(<p n ) 



n= 1 



The random variable X is Gaussian with mean of A and standard devia- 
tion of a. Random variable Y is Gaussian with zero mean and standard 
deviation of a . The probability density function for the envelope is now 
a Rician distribution and is given by 



(r 2 +A 2 ) 



p(r) = —^e 2 - 2 J 0 — s- 



rA 



r > 0 A > 0 



( 6 . 22 ) 



where 7o() = Modified Bessel function of first kind and zero-order. 

We can characterize the Rician distribution by a parameter K — 
A 2 /(2ct 2 ). K is the direct signal power to multipath variance ratio. K is 
also called the Rician factor. We can also express K in dB as 

ff(dB) = 101og 10 (^) (6.23) 

In the case where A = 0, the Rician distribution reverts to a Rayleigh 
distribution. A plot of the Rician distribution for three K values is shown 
in Fig. 6.11. 



Example 6.4 For the Rician-fading channel where a = 3 mV, the direct path 
amplitude A = 5 mV, what is the probability that the received voltage enve- 
lope will exceed a threshold of 5 mV? 



Solution The probability of the envelope as exceeding 5 mV is given by 

f 00 r <r 2 +A 2 ) / rA\ 

P(r > .005) = / — ^ I 0 ( — ) dr — .627 

J . 005 \ a ) 

This can be shown as the shaded area under the curve in Fig. 6.12. 
As compared to Example 6.2 with no direct path, Example 6.4 demon- 
strates that the likelihood of having a detectable signal dramatically 
increases when the direct path is present. 

Motion in a fast fading channel. In the above examples of line-of-sight 
(LOS) and non-line-of- sight (NLOS) propagation, it was assumed that 
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Figure 6.11 Rician distribution. 



there was no motion in the channel. Motion will change the channel 
behavior by changing the location of the transmitter or receiver. In 
addition, motion introduces many discrete Doppler shifts in the received 
signal. Figure 6.13 shows a moving transmitter in a multipath environ- 
ment with no direct path. 




Figure 6.12 Rician probability density with indicated threshold. 



More Books Visit : www.iqbalkalmati.blogspot.com 



Propagation Channel Characteristics 137 




As the vehicle moves at a constant velocity in Fig. 6.13, many factors 
change with time. The angles (()„) of each multipath signal are time 
dependent. Each multipath experiences a different Doppler shift 
because the angle of scattering with respect to the moving vehicle is 
different for each scattering object. Also, the overall phase shift (a„) 
changes with time because the propagation delays are changing. 

The maximum possible Doppler shift is given by 

fd=fo~ (6-24) 

c 



where fa — Doppler frequency 
fo — carrier frequency 
v — vehicle velocity 
c = speed of light 

Since the direction of vehicle travel is at an angle 9 n with the nth mul- 
tipath, the Doppler shift is modified accordingly. The Doppler shift for 
the ?ith path is given by 

fn = fd COS 6 n = f 0 ~ COS Q n (6.25) 

c 

We can now rewrite Eq. (6.14) accounting for the Doppler frequency 
shifts f n . 

N N 

v r = ^2 a « cos(2 Txfnt + </>„) cos(<w 0 t) - ^ o„ sin(2 jrfnt + (fn) sin(®oD 

n=l n = 1 

N 

= ^ a„ cos(2tt fd cos(9 n )t + 0„)cos(oj o D 

77=1 

N 

- ^2 a n sin(2 Ttfd cos (9 n )t + <j>n ) sin(oj 0 7) 



(6.26) 
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We now have three random variables a n , <p n , and O n . The amplitude 
coefficients are Gaussian distributed whereas the phase coefficients are 
presumed to have a uniform distribution such that 0 < </>„ and 6 n < 2u. 
The envelope of v r again has a Rayleigh distribution. The envelope r is 
given by 



r = \J X 2 + Y 2 (6.27) 

where X = YJl=i a n cos(2 nfd cos (0 n )t + <p n ) and 
Y = J2n= i a n sin(2 nfd cos (6 n )t + </>„) 

This model is called the Clarke flat fading model [7, 14]. 

Example 6.5 Use MATLAB to plot the envelope in Eq. (6.27) where the carrier 
frequency is 2 GHz, the vehicle velocity is 50 mph, the phase angles 4> n and 
6 n are uniformly distributed, the coefficient a n has a Gaussian distribution 
with zero mean and standard deviation of a = .001. Let N = 10. 

Solution We must first convert the velocity to meters/second. Therefore, 
50 mph = 22.35 m/s. Thus, the maximum Doppler shift is 

fd = 2- 10 9 - = 149 Hz 

' 3 ■ 10 8 

Using the following short MATLAB program, we can plot the results as 
demonstrate in Fig. 6.14. 




Figure 6.14 Doppler fading channel with N = 10 Paths. 
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% Fast fading with velocity Example 6.5 



N = 10; 

a= . 001*randn (N, 1 ) ; 
th=rand (N, 1 ) *2 *pi ; 
ph=rand (N, 1 ) *2 *pi ; 
fd=149 ; 
tmax = 10 /fd; 



% number of scatterers 

% create Gaussian amplitude coefficients 
% create uniform phase angles 

% Doppler 
% Maximum time 



omega=2*pi*fd; 

t= [ 0 : 1000 ] *tmax/1000 ; % generate timeline 
X= [zeros (1, length (t) ) ] ; 

Y= [zeros (1, length (t) ) ] ; 

for n=l:N % generate the sums for X and Y 

X=X+a (n) *cos (omega*cos ( th (n) ) *t+ph (n) ) ; 

Y=Y+a (n) *sin (omega*cos ( th (n) ) *t+ph (n) ) ; 
end 

r=sqrt (X. ^2+Y. ^2) ; % Calculate the Rayleigh envelope 
rdb=20*logl0 (r) ; % Calculate the envelope in dB 

figure ; 

plot ( t*1000 , rdb, 1 k 1 ) % plot 
xlabel ( ' time (ms) 1 ) 
ylabel ( 1 envelope 1 ) 
axis ( [0 65 -30 10] ) 



Example 6.5 is representative of Doppler fading but makes assump- 
tions that are not realistic. In the example, the scattering from objects 
is angularly dependent. Thus, the coefficients a„ will be a function of 
time. Additionally, the phase angles </>„ and 0 n change with time. The 
Clarke model can be modified to reflect the time dependence of a n , </>„, 
and 9 n . 

If we assume a large number of paths, the uniform distribution of 
angles 0 n results in a sinusoidal variation in the Doppler frequencies 
/„ . This transformation of the random variable results in a Doppler 
power spectrum derived by Cans [15] and Jakes [16]. 

S d ( f) = , g2 I f\ < fd (6.28) 

7 rfd \l 1 ~{i) 

where a 2 — Yln= l = average power of signal. 

A plot of the Doppler power spectrum is shown in Fig. 6.15. 

As a test of the validity of Eq. (6.28) we can rerun the program of 
Example 6.5 by increasing the scatterers to 100 and padding the ampli- 
tude X with zeros. Again, the maximum Doppler shift is 149 Hz. After 
taking the fast Fourier transform, we can plot the results as seen in 
Fig. 6.16. The spectrum bears a similarity to the theoretical spectrum. 
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Figure 6.15 Doppler power density spectrum. 



However, the difference lies in the fact that the theoretical spectrum 
assumes that the number of scatterers is large enough to apply the 
Central Limit Theorem. In that case, there is a true Gaussian distri- 
bution on the scattering amplitudes and a true uniform distribution on 
the angles. 




Figure 6.16 Doppler power density spectrum. 
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6.3.3 Channel impulse response 

If we assume that the radio channel can be modeled as a linear filter 
then the characteristics of the channel can be modeled by finding the 
impulse response of the channel. Thus, all signal responses to the chan- 
nel are dictated by the impulse response. The impulse response also 
gives an indication of the nature and number of the multiple paths. If it 
is assumed that the channel characteristics can change with time (i.e., 
the mobile user is moving), the channel impulse response will also be 
function of time. The generalized channel impulse response is given as 
follows: 



N 

h c (t, r) = y 'a n (t)e j ' l ' n(t) S(t - r n (t)) (6.29) 

n= 1 

where a n (t) = time varying amplitude of path n 

%lr n (t) = time varying phase of path n which can include the 
effects of Doppler 

t n {t) = time varying delay of path n 

An example of an impulse response magnitude, \h c (t, r)|, is shown 
plotted in Fig. 6.17. The amplitudes generally diminish with increasing 
delay because the spherical spreading increases in proportion to the 
delay. Differences in reflection coefficients can cause an exception to 
this rule for similar path lengths because slightly more distant sources 
of scattering might have larger reflection coefficients. The discrete time- 
delayed impulses are sometimes called fingers, taps, or returns. 

If we additionally assume that the channel is wide sense stationary 
(WSS) over small-scale times and distances, the impulse response will 




Figure 6.17 Channel impulse response at four instants in time. 
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remain stable for short time/fast fading. Thus, we can further simplify 
the impulse response to be approximated as 

N 

h c (r) = y ^a n e j ' l ' n 8(T - z n ) (6.30) 

71=1 



6.3.4 Power delay profile 

In the case of small-scale fading, a channel metric can be defined to 
aid in understanding channel behavior. This metric is called the power 
delay profile (PDP). (The PDP is essentially identical to the multipath 
intensity profile (MIP) [17, 18]). The PDP can be defined as ([8, 9, 19]) 

N 

P{ r) = E[\h c {t, r) | 2 ] = ^ P n 8(r - r n ) (6.31) 

71=1 



where P n = (|a„(7)| 2 ) = a 2 

r n = (t„(7)> 

(x) = estimate of the random value x 

A typical power delay profile for an urban area is shown in Fig. 6.18 
where the fingers are the result of scattering from buildings. 

The PDP is partly defined by the trip time delays (r„). The nature 
and characteristics of the delays helps to define the expected channel 
performance. Thus, it is important to define some terms regarding trip 
delays. Several valuable texts define these statistics [3, 5, 6, 8]. 

■ First Arrival Delay (r A ): This is measured as the delay of the earliest 
arriving signal. The earliest arriving signal is either the shortest 
multipath or the direct path if a direct path is present. All other delays 
can be measured relative to r A . Channel analysis can be simplified 
by defining the first arrival delay as zero (i.e., x A — 0). 

■ Excess Delay: This is the additional delay of any received signal rel- 
ative to the first arrival delay r A . Typically, all delays are defined as 
excess delays. 



P(r) 

A 




As 



A 



Figure 6.18 Typical urban power 
delay profile. 
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Figure 6.1 9 Depiction of PDP 
delay statistics. 



■ Maximum Excess Delay ( tm): This is the maximum excess delay 
where the PDP is above a specified threshold. Thus, P{tm) = Pth (dB). 

■ Mean Excess Delay ( xq): The mean value or first moment of all excess 
delays 



to 



J2n= 1 Pn*n 

spN p 

Ljn = 1 r n 



Sn=l 



(6.32) 



where Pt = J2n = l — multipath power gain. 

■ RMS Delay Spread ( a t ): This is the standard deviation for all excess 
delays. 



a x = 





(6.33) 



Figure 6.19 demonstrates the various terms defined earlier. 

Example 6.6 Calculate the multipath power gain (Pt), the mean excess delay 
(tq), and the RMS delay spread (cr T ) for the PDP given in the Fig. 6.20. 




0 1 



3 



5 



Figure 6.20 Power delay profile. 
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Solution First we must convert all powers to a linear scale. Thus, Pi = .1, 
= .32, P 3 = .1, P 4 = .032. The multipath power gain is 

N 

P r = ^ P„ = .552 or - 2.58 dB. 

71=1 

The mean excess delay is 

Eti ,lx 0 + .32 x 1 + .1 x 3 + .032 x 5 1 _ 

r 0 = = = 1.41 us 

° P T .552 h 

The RMS delay spread is 




6.3.5 Prediction of power delay profiles 

It is sometimes informative to model the power delay profiles in order 
to: (1) understand channel behavior, (2) evaluate the performance of 
equalizers, and (3) estimate bit error rate (BER) performance. Numer- 
ous measurements have been performed on indoor and outdoor chan- 
nels and three relatively useful models have been proposed (Chuang 
[20], Feher [6]). The total received power is given by Pt- The three 
models are: 

One-Sided Exponential Profile. This profile seems to most accurately 
describe both indoor and urban channels. 

Pit) = —e~^ x > 0 (6.34) 

of 



Gaussian Profile 



Pt 

V2ncr T 



Ox 



) 2 



P(x) — 



(6.35) 
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Equal Amplitude Tivo-Ray Profile 

P(r) = ^[<5(r) + S(r - 2 a T )] 



(6.36) 



6.3.6 Power angular profile 

The PDP and the path delays are instructive in helping one to und- 
erstand the dispersive characteristics of the channel and to calculate 
the channel bandwidth. The PDP is especially relevant for single-input 
single-output (SISO) channels since one can view the impulse response 
as being for a SISO system. However, when an array is used at the 
receiver, the angles of arrival are of interest as well. Since the array has 
varying gain vs. angle-of-arrival (G(9)), it is instructive to also under- 
stand the statistics of the angles-of-arrival such as angular spread and 
mean angle-of-arrival (AOA). Every channel has angular statistics as 
well as delay statistics. Basic concepts in modeling the AOA have been 
addressed by Rappaport [9], Gans [15], Fulghum, Molnar, Duel-Hallen 
[21, 22], Boujemaa and Marcos [23], and Klein and Mohr [24]. 

This author will define the angular equivalent of the PDP as the 
power angular profile (PAP). Earlier references discuss the concept of a 
power angle density (PAD ) [23] or a power azimuth spectrum (PAS) [22]. 
However, the PAD or the PAS is more analogous to the traditional power 
spectral density (PSD) rather than to the PDP used previously. The con- 
cept of a PAP immediately conveys angular impulse response informa- 
tion which assists in channel characterization. Thus the PAP is given as 

N 

Pie) = Y PnSie - On) (6.37) 

n—1 



Along with the PAP, we can define some metrics as indicators of the 
angular characteristics of the propagation paths. 



■ Maximum Arrival Angle (9m)'- This is the maximum angle relative to 
the boresight ( 63 ) of the receive antenna array. The boresight angle is 
often the broadside direction for a linear array. The maximum angle 
is restricted such that \9m\ — 9b < 180°. 

■ Mean Arrival Angle ( 9o): The mean value or first moment of all arrival 
angles 



do = 



X)n=l Pn.9 n 

Ell Pn 




(6.38) 



where Pt = Eh Pn — multipath power gain. 
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P(d) 




Figure 6.21 Power angular profile. 



■ RMS Angular Spread (a,,): This is the standard deviation for all 
arrival angles 



er<? 




Figure 6.21 depicts a representative PAP. 



(6.39) 



Example 6.7 Calculate the multipath power gain ( Pt ), the mean arrival angle 
(do), and the RMS angular spread (og) for the PAP given in the Fig. 6.22 

Solution First we must convert all powers to a linear scale. Thus, Pi = .1, 
= -32, P 3 = .1, P 4 = .032. The multipath power gain is 

N 

Pt = P n = -552 W 

n= 1 

The mean arrival angle is 

_ Et=l p n°n 1 x (-80) + .32 x (-45) + .1 x (40) + .032 x (60) 

9 ° ~ Pt “ A52 

= -29.86° 




Figure 6.22 Power angular 
profile. 
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The RMS angular spread is 







— 9 



2 

0 



.1 X (-80) 2 + .32 X (-45) 2 + .1 X (40) 2 + .032 x (60) 2 
J552 



- (-29.86) 2 



= 44° 



An alternative approach for defining angular spread is given in Rap- 
paport [9] where the angular spread is not found by defining first and 
second moments but rather the angular spread is determined through 
the use of a Fourier transform. This is akin to the use of the moment 
generating function in stochastic processes. The following description is 
slightly modified from the original Rappaport definition. We must first 
find the complex Fourier transform of the PAR Thus 



r2n 



F k = 



P(0)e~ jke de 



where Fk = k\h complex Fourier coefficient. 
The angular spread is now defined as 



(6.40) 



I Ed 2 

ere = OwidthJ 1 p 2 (6.41) 

where () w idth is the angular width of the PAP. 

If indeed we use the power angular profile defined in Eq. (6.34), we 
can use the sifting property of the delta function and can find the Fourier 
coefficients. 



F k 




N 

~ jke dd = ]T P n e~ jkBn 

n= 1 



(6.42) 



where 



N 

Fo = P » = p r 

11— 1 

N 

F 1 ^j2 p - e ~ j8n 

n=l 



Example 6.8 Repeat Example 6.7 solving for the angular spread using the 
Rappaport method. 
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Solution The total angular width in Example 6.7 is 140°. The value of Fq is 
the same as the total power, thus F (} — Pj = .552. The value of F\, found 
from Eq. (6.39), is 

4 

Fi = Y^ Pne~ j8n = .le- 7 ' 800 + ,32e j45 ° + .le~ j40 ° + .032e~ jeo ° 

l 

= .34 + j. 23 

Substituting into Eq. (6.41) we can find the angular spread thus 

oe = 94.3° 

This angular spread is larger than the angular spread in Example 6.7 by 
almost a factor of 2. 

6.3.7 Prediction of angular spread 

There are numerous potential models to describe angular spread under 
various conditions. An excellent overview is given by Ertel et al. [25]. 
Additionally, derivations are given for a ring of scatterers and a disk 
of scatterers in Pedersen, Mogensen, and Fleury [26]. Several models 
are given next showing the angular distribution for a ring of scatterers, 
disk of scatterers, and an indoor distribution of scatterers. 

Ring of Scatterers: Figure 6.23 shows the transmitter and receiver 
where the transmitter is circled by a ring of scatterers uniformly 
distributed about the ring. 

If we assume that the scattering occurs from a ring of scatterers, at 
constant radius, surrounding the transmitting antenna, then it can 
be shown that the PAP can be modeled as 

P{6) = — Pt (6.43) 

Ttsje^-e 2 

where 0m is the maximum arrival angle and Pt the total power in all 
angular paths. 

A plot of this distribution is shown in Fig. 6.24 for the example 
where Pt — n and 9m — 45°. 
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Disk of Scatterers: Figure 6.25 shows the transmitter and receiver 
where the transmitter is enveloped by a uniform disk of scatterers. 

The disk diameter is chosen to encompass the circle of influence. 
That is, the region whose radius encompasses the significant scatter- 
ers that create the largest multipath signals. Since the scatterers are 
uniformly distributed within the disk, the PAP is given by 



P(0) = ^-^0l-02 (6.44) 

jt6 m 

A plot of this distribution is shown in Fig. 6.26 for the example where 
P T = 7r and 0 M = 45°. 



Indoor Distribution: Figure 6.27 shows a typical transmitter and 
receiver indoors. 

Extensive field tests have demonstrated that both indoor scattering 
and congested urban scattering can most closely be modeled by a 




PAP 
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Laplace distribution [22, 26, 27]. Thus, the PAP can be modeled as 

Pt — I I 

P(0) = —j= — e I I (6.45) 

72 a e 

A plot of this distribution is shown in Fig. 6.28 for the example where 
Pt = 7t and a$ = 30°. 

The exponential distribution is intuitive because the largest 
received signal is the direct path or line-of-sight signal at 0°. Since the 
largest reflection coefficients are for near grazing angles, the smallest 
angles of arrival represent signals reflecting at grazing angles from 
the closest structures to the line-of-sight. As the angles further in- 
crease, the additional paths are more the result of diffraction and less 
the result of reflection. Diffraction coefficients are generally much 
smaller than reflection coefficients because diffraction produces an 
additional signal spreading. Wider arrival angles tend to represent 
higher order scattering mechanisms. 




Figure 6.27 Indoor propagation. 
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Arrival angle 

Figure 6.28 PAP for indoor scattering. 



6.3.8 Power delay-angular profile 

The next logical extension to the PDP and the PAP is the power delay- 
angular profile (PDAP). This can be viewed as the power profile 
resulting from the extended tap-delay-line method developed by Klein 
and Mohr [24], one that was further explored by Liberti and Rappaport 
[28] . This concept also can be derived from the work of Spencer et al. 
[27], Thus, the PDAP is 



N 

P(t, 9) = P n S( r - Tn)8(6 - 9 n ) (6.46) 

n= 1 

We can combine the excess delays of Example 6.7 along with the arrival 
angles of Example 6.8 to generate a three-dimensional plot of the PDAP 
as shown in Fig. 6.29. 

The extent of correlation between the time delays and angles of 
arrival will depend on the scattering obstacles in the channel. If most 
scatterers lie between the transmitter and the receiver, the correlation 
will be high. If the transmitter and receiver are surrounded by scatter- 
ers there may be little or no correlation. 

6.3.9 Channel dispersion 

From an electromagnetics perspective, dispersion normally occurs 
when, in a medium, the propagation velocities are frequency dependent. 
Thus, the higher frequencies of a transmitted signal will propagate at 
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Figure 6.29 Power delay-angular profile. 

different velocities than the lower frequencies. The high and low fre- 
quency components will produce different propagation delays even with 
no multipath present. This results in signal degradation at the receiver. 
However, a received signal can also be degraded by virtue of the fact that 
the time delays of the multipath components can become a reasonable 
fraction of the symbol period. Thus, the received signal is effectively de- 
graded by time-delayed versions of itself. As the delay spread increases, 
the excess delays increase causing time dispersion. Figure 6.30 displays 
a direct path Gaussian pulse and the received signal for three increas- 
ing delay spreads (eri, a-i, and < 73 ) . All signals are normalized. It is 
clear that the original pulse “broadens” as the delay spread increases. 
If the delay spread increases, this corresponds to a narrower channel 
bandwidth. 

Since a channel can cause time-dispersion, it is necessary to define a 
channel bandwidth Be, also referred to as a coherence bandwidth. This 
will help give an indication as to whether the channel bandwidth is 
sufficient to allow a signal to be transmitted with minimal dispersion. 
The channel bandwidth is described in Shankar [3] and Stein [29] and 
is approximately defined by 



where er T is the delay spread. 

Therefore if the signal chip rate or bandwidth ( Bs ) is less than the 
channel bandwidth (Be), the channel undergoes flat fading. If the signal 




(6.47) 
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Excess delay («sec) 
Figure 6.30 Dispersion caused by delay spread. 



bandwidth is greater than the channel bandwidth, the channel will 
undergo frequency-selective fading. In this case dispersion occurs. 

6.3.10 Slow fading modeling 

Figure 6.5 demonstrates the trend of slow fading (also known as shadow 
fading). This represents the average, about which fast fading occurs. 
Instead of a single scattering mechanism, the transmitted signal can 
reflect, refract, diffract, and scatter multiple times before arriving at 
the receiver. Figure 6.31 demonstrates some of the candidate multipath 
mechanisms in the slow fading case along with coefficients representing 
each scattering location. It is assumed that no line-of-sight path exists. 

The received signal can be represented as a sum of all multipath 
terms. 

N 

V r {t) = J2 a ne j * nt 

71 — 1 



(6.48) 
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Figure 6.31 Slow fading with multiple scattering mechanisms. 

The coefficients a n represent the cascade product of each reflection 
or diffraction coefficient along path n. Thus, we can write a separate 
expression for the amplitude coefficients as 



where b mn — Rayleigh distributed random variables 
M — number of scatterers along path n. 

These multiple scattering events will affect the mean value of the 
received power. As was discussed earlier, the total power received (mul- 
tipath power gain) is the sum of the square of the coefficients a n . How- 
ever, the coefficients a n are the consequence of the products of the bmn. 
The logarithm of the power is the sum of these random variables. Using 
the Central Limit Theorem, this becomes a Gaussian (normal) distribu- 
tion. Hence the name log-normal. Several references on the log-normal 
distribution are Shankar [3], Agrawal and Zheng [4], Saunders [8], 
Rappaport[9], Lee [30], and Suzuki [31]. The pdf of the power in dBm 
is given as 



where P — power p in dBm = 10 ■ log 10 (p) 

P 0 = average signal level in dBm 
ct — standard deviation in dBm 

Using a transformation of the random variable P, we can also express 
the pdf in terms of linear power p. 




(6.49) 




(6.50) 




(6.51) 
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where p — power in mW 

p 0 = average received signal level in mW 

logioto 
°° ^10 

A typical plot of the log-normal distribution given in Eq. (6.51) is shown 
in Fig. 6.32. 



6.4 Improving Signal Quality 

One of the drawbacks of the typical multipath channel is the fact that 
the signal quality is degraded by Doppler spread and dispersion. The 
negative effects of dispersion were briefly discussed in Sec. 6.3.9. If 
the signal bandwidth is greater than the channel bandwidth, we have 
frequency-selective fading. The consequence is an increasing intersym- 
bol interference (ISI) leading to an unacceptable BER performance. 
Thus data transmission rates are limited by the delay spread of the 
channel. Since we understand the nature of dispersion and Doppler 
spreading, we can devise methods to compensate in order to improve 
signal quality at the receiver. An excellent discussion of compensation 
techniques is found in Rappaport [9]. The three basic approaches to 
compensate for dispersion and Doppler spread are equalization, diver- 
sity, and channel coding. 
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Received 

signal 



Channel impulse response Equalizer impulse response 



Figure 6.33 Communications system using an adaptive equalizer. 



6.4.1 Equalization 

Channel equalization is the act of reducing amplitude, frequency, time, 
and phase distortion created by a channel. The goal of the equalizer is 
to correct for the frequency selective fading effects in a channel where 
the signal bandwidth (Bs) exceeds the channel coherence bandwidth 
(Be). The equalizer can be a signal processing algorithm that seeks 
to minimize the ISI. Unless the channel characteristics are fixed with 
time, the equalizer must be an adaptive equalizer which compensates as 
the channel characteristics change with time. The ultimate goal of the 
equalizer is to completely neutralize the negative effects of the channel. 
Figure 6.33 demonstrates the ideal equalizer. The goal of the equalizer 
impulse response is to negate the channel impulse response such that 
the receive signal is nearly identical to the transmitted signal. 

The ideal equalizer frequency response which negates the channel 
influence would be given as 

If the channel frequency response can be characterized by H c (f) = 
\H C ( /')|e J 'Uf ) then the equalizer frequency response would be given 
by H eq ( f ) = such that the product of the two filters is unity. 

The channel frequency response and equalizer response are shown in 
Figure 6.34. 

Example 6.9 If the channel impulse response is given by h c (t) = .2S(t) + .35 
(i t — r), find the channel frequency response and the equalizer frequency 
response. Superimpose plots of the magnitude of H c (f) and H eq (f) for 0 < 
fx < 1.5. 



Solution The channel frequency response is the Fourier transform of the chan- 
nel impulse response. Thus 



/ OO 

(.2 5(f) + .35 (t - x ))e~ j2nft dt 

-OO 

= .2 + .2,e~ j2nfz 



The equalizer frequency response is given as 



H eq (f) = 



1 

H*(-f) 



1 

.2 + .3 e~j 2n f r 



.2 + .3 ej 2n B 
.13 + .12 cos(2nfx) 
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Figure 6.34 Channel and equalizer frequency response. 



Since the channel characteristics are changing with time, it is neces- 
sary to design an adaptive equalizer that is able to respond to changing 
channel conditions. Figure 6.35 demonstrates a block diagram of an 
adaptive equalizer. The error signal is used as feedback to adjust the 
filter weights until the error is minimized. 

The drawback of adaptive algorithms is that the algorithm must go 
through a training phase before it can track the received signal. The 
details of adaptive equalization are beyond the scope of this text but 
can be further explored in the Rappaport reference given previously. 
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Figure 6.35 Adaptive equalizer. 
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6.4.2 Diversity 

A second approach for mitigating the effects of fading is through 
diversity. Diversity means that a plurality of information is transmit- 
ted or received allowing one to reduce the depth and/or the duration of 
fades. This plurality can be achieved by having multiple receive antenna 
elements (antenna space diversity), different polarizations (polariza- 
tion diversity), different transmit frequencies (frequency diversity), or 
different time characteristics such as Code division multiple access 
CDMA (time diversity). Thus, one can choose, amongst a collection of 
received signals, which signal provides the least amount of fading. This 
can be achieved by choosing perhaps the optimum antenna element, the 
optimum polarization, the optimum carrier frequency, or the optimum 
time diversity signal. Diversity has an advantage in that no adaptation 
or training is required in order to optimize the receiver. 

Space diversity can be implemented through one of four basic 
approaches: selection diversity (select the largest signal from the 
antenna outputs), feedback diversity (scan all antenna outputs to find 
the first with a sufficient SNR for detection), maximal ratio combining 
(weight, co-phase, and sum all antenna outputs), and equal-gain com- 
bining (co-phase all received signals and combine with unity weights). 

Polarization diversity is normally implemented through orthogonal 
polarizations. Orthogonal polarizations are uncorrelated and one can 
receive both polarizations with a dual polarized receive antenna. In 
addition, left and right hand circular polarizations can be used. One 
can select the polarization that maximizes the received signal-to-noise 
ratio. 

Frequency diversity can be implemented by using multiple carrier fre- 
quencies. Presumably if a deep fade occurs at one frequency, the fade 
may be less pronounced at another frequency. It is suggested that the 
frequency separation be at least the channel coherence bandwidth (Be) 
in order to ensure that the received signals are uncorrelated. Thus, 
the collection of possible transmit frequencies can be f 0 ± nB c (n — 
0 , 1 , 2 ...). 

Time diversity can be implemented by transmitting the same infor- 
mation multiple times where the time delay exceeds the coherence 
time ( T c — 1 /Be). Thus, the times of transmission can be Tq + tiT c 
(n = 0, 1, 2 . . .). Time diversity can also be implemented by modulating 
the transmission with CDMA pseudo-random codes. Typically these pn 
codes are uncorrelated if the different signal paths cause time delays 
exceeding a chip width. A RAKE receiver is a possible implementation 
of a time diversity scheme. 

RAKE receiver. A channelized correlation receiver can be used where 
each channel attempts a correlation with the received signal based 
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Input 




Figure 6.36 Structure of a RAKE receiver. 



upon the anticipated path delays. One channel is assigned for each of 
the anticipated M strongest received components. The time delay r m is 
associated with the anticipated delay of the m th path. The correlating 
CDMA waveform is given by <p. Each channel can also have a weight 
W n to allow the receiver to produce maximal ratio combining or equal- 
gain combining. Figure 6.36 demonstrates such a channelized receiver 
called a RAKE receiver. The RAKE receiver received its name based 
upon its similarity to a garden rake (Price and Green [32]). 

As an example of a two channel RAKE receiver, we can transmit a 
32-chip CDMA waveform of length T = 2 ps. Let there only be two paths 
to the receiver. The excess delay for path 1 is 0 ps and the excess delay 
for path 2 is 71.6 ns or 1.3 times the chip width (r c hi p = T/ 32 = 62.5 ns). 
The received waveforms are shown in Fig. 6.37. The correlation of the 
first waveform with itself is greater than four times the correlation with 
the delayed copy of itself. Thus, the first channel has a high correlation 
with the first arriving waveform and the second channel has a high 
correlation with the second arriving waveform. The first channel excess 
delay = 0 ps. The second channel excess delay t‘i — 71.6 ns. 

6.4.3 Channel coding 

Because of the adverse fading effects of the channel, digital data can 
be corrupted at the receiver. Channel coding deliberately introduces 
redundancies into the data to allow for correction of the errors caused by 
dispersion. These redundant symbols can be used to detect errors and/or 
correct the errors in the corrupted data. Thus, the channel codes can 
fall into two types: error detection codes and error correction codes. The 
error detection codes are called automatic repeat request (ARQ) codes. 
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Waveform with 0 excess delay 




time jUsec 

Waveform with 1.3r excess delay 




time jUsec 

Figure 6.37 Channel 1 and 2 waveforms. 



The error correction codes are forward error correcting (FEC) codes. 
The combination of both is called a hybrid-ARQ code. Two basic types of 
codes can be used to accomplish error detection and/or error correction. 
They are block codes and convolutional codes. Turbo codes are a type of 
convolutional code. An excellent description of these codes and channel 
coding schemes is given by Rappaport [9], Sklar [10], Proakis [19], and 
Parsons [33]. 

6.4.4 MIMO 

MIMO stands for multiple-input multiple-output communication. This 
is the condition under which the transmit and receive antennas have 
multiple antenna elements. This is also referred to as volume-to-volume 
or as a multiple-transmit multiple-receive (MTMR) communications 
link. MIMO has applications in broadband wireless, WLAN, 3G, and 
other related systems. MIMO stands in contrast to single-input single- 
output (SISO) systems. Since MIMO involves multiple antennas, it can 
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M - antenna elements 



Af-antenna elements 



Figure 6.38 MIMO system. 



be viewed as a space diversity approach to channel fading mitigation. 
Excellent sources for MIMO systems information can be found in the 
IEEE Journal on Selected Areas in Communications: MIMO Systems 
and Applications Parts 1 and 2 [34, 35], Vucetic and Yuan [36], Dig- 
gavi et al. [37], and Haykin and Moher [38]. A basic MIMO system is 
illustrated in Fig. 6.38 where H is the complex channel matrix relating 
M inputs to N outputs, s is the complex transmit vector, and x is the 
complex receive vector. 

The goal of MIMO is to combine signals on both the transmit and 
receive ends such that the data rate is increased and/or the ISI and BER 
are decreased. As was mentioned previously, one diversity option for a 
SISO system is time diversity. MIMO allows the user to combine time 
diversity and space diversity. It can be said that MIMO is more than a 
space diversity option but is a space-time signal processing solution to 
channel fading. Multipath propagation has been viewed as a nuisance 
in SISO systems but it is actually an advantage for MIMO systems 
because the multipath information can be combined to produce a better 
received signal. Thus the goal of MIMO is not to mitigate channel fading 
but rather to take advantage of the fading process. 

We may follow the development in Haykin and Moher [38] in order to 
understand the mathematical description of the MIMO system. Under 
flat-fading conditions, let us define the M-element complex transmit 
vector, created by the M-element transmit array, as 

S = [.Si S2 S3 ... sm\ t (6.53) 

The elements s m are phased according to the array configuration and 
are assumed to be coded waveforms in the form of a data stream. If we 
assume that the transmit vector elements have zero mean and variance 
cr 2 , the total transmit power is given by 



P t — M ■ er s 2 



(6.54) 
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Each transmit antenna element, m, connects to a path (or paths) to the 
receive elements, n, creating a channel transfer function, li nm . There are 
consequently N ■ M channel transfer functions connecting the transmit 
and receive array elements. Thus, we can define the N x M complex 
channel matrix as 



/ill 


h\2 ■ 


■ ■ h\M 




/*21 


/*22 • 


' ■ h‘2M 


(6.55) 


_hm 


h-m ■ 


■ ■ hjS[M_ 





Let us now define the Ai-element complex receive vector, created by the 
Ai-element receive array, as 

x — [x\ X2 X 3 ... x^] T (6.56) 

Let us also define the IV-element complex channel noise vector as 

n=[n 1 n 2 ji 3 ... n N ] T (6.57) 

We can now describe the receive vector in matrix form as 

x — H ■ s + h (6.58) 

Assuming a Gaussian distribution for the transmitted signal, the 
channel, the received signal, and the noise, one can easily estimate the 
MIMO channel capacity. The correlation matrix for the transmitted 
signal is given by 

R s = E[s ■ s ff ] 

= o^I M (6.59) 



where ct s 2 = signal variance 

I M = M x M identity matrix. 

Here I m is the M x M identity matrix and s H denotes the Hermitian 
transpose of s. The correlation matrix for the noise is given by 

R n — E[h ■ h H ] 

= cr 2 /jv (6.60) 



where er 2 = noise variance 

I N = N x N identity matrix. 

The MIMO channel capacity is a random variable. Since the chan- 
nel itself is random, we can define the ergodic (mean) capacity. If we 
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assume that the sources are uncorrelated and of equal power, the ergodic 
capacity is given as [39-41] 

bits/s/Hz (6.61) 

where, the expectation is over the random channel matrix H and 
det = determinant 
Cep — equal power capacity 
p — SNR at each receive antenna = P l 

One possible algorithm for use in a MIMO system is the V-BLAST 
algorithm developed at Bell Labs [42]. The V-BLAST algorithm is an 
improvement over its predecessor D-BLAST. (V-BLAST stands for 
Vertically layered blocking structure, Bell Laboratories Layered Space- 
Time). The V-BLAST algorithm demultiplexes a single data stream into 
M substreams which undergo a bit-to-symbol mapping. The mapped 
substreams are subsequently transmitted from the M transmit anten- 
nas. Thus, the total channel bandwidth used is a fraction of the orig- 
inal data stream bandwidth allowing for flat fading. The detection at 
the receiver can be performed by conventional adaptive beamforming. 
Each substream is considered to be the desired signal and all other 
substreams are deemed as interference and are therefore nulled. Thus, 
M simultaneous beams must be formed by the N-element receive array 
while nulling the unwanted substreams. The received substreams may 
then be multiplexed to recover the intended transmission. 

A further exploration of MIMO concepts can be pursued in references 
[34-42], 
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Problems 

6.1 Use the flat earth model. Assume the following values: |i?| = .5, .7, i// = 0, 
Eo/r\ = L hi = 5 m, h 2 = 20 m, d = 100 m. 

(a) What are the arrival time delays tj and t 2 ? 

(h) Plot a family of curves for the envelope and the phase vs. (2khih 2 )/d 

6.2 Use the flat earth model. Assume the following values: |i?| = .5, \jj — 0, 
hi — 5 m, h 2 = 20 m, d = 100 m. A baseband rectangular pulse is transmitted 
whose width is 10 ns. The transmit pulse is shown in Fig. P6.1. The received 
voltage magnitude for the direct path Vj = IV and is proportional to E 0 /ri . 

(a) What are the arrival time delays ti and t 2 ? 

(b) Estimate the received voltage magnitude V 2 for the reflected path us- 
ing spherical spreading and | R\ . 

(c) Plot the received pulses for the direct and indirect paths showing pulse 
magnitudes and time delays. 

6.3 Let us model an urban environment by using MATLAB to generate 100 
values of the amplitude a n , the phase a n , and the distance r„ . Use Eqs. (6.13), 
(6.14), and (6.15). Generate values for the amplitudes, a n , using randn and 
assume that o = 1 mV. Use rand to generate phase values for ot n to vary between 
0 and 2n. Use rand to define r n = 500 + 50* rand(l,N). Find the phase </>„. Plot 
the envelope r (mV) for 100 kHz < f < 500 kHz. Let the x axis be linear and 
the y axis be log 10. 



Vo 



0 10 ns 



Figure P6.1 Transmit pulse. 
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Figure P6.2 Power delay profile. 



6.4 Calculate the outage probability for the Rayleigh process where a = 2 mV 
and the threshold pth = 3 pW. 

6.5 Plot the probability density function for the Rician distribution where the 
Rician factor K = — 5 dB, 5 dB, and 15 dB. Superimpose all curves on the same 
plot. Assume that a = 2 mV. 

6.6 Repeat Prob. 3 for the Rician channel where A= 4a. 

6.7 For the Rician-fading channel where a = 2 mV, the direct path amplitude 
A = 4 mV, what is the probability that the received voltage envelope will exceed 
a threshold of 6 mV? 

6.8 Use MATLAB to plot the envelope in Eq. (6.27) where the carrier frequency 
is 2 GHz, the vehicle velocity is 30 mph, the phase angles 4> n , 9 n are uniformly 
distributed, the coefficient a n has a Gaussian distribution with zero mean and 
standard deviation of a = 0.002. Let N = 5. 

6.9 Calculate the multipath power gain Pf, the mean excess delay r 0 , and the 
RMS delay spread a T for the PDP given in Fig. P6.2. 

6.10 Calculate the multipath power gain Pt, the mean excess delay r () , and 
the RMS delay spread a T for the PDP given in Fig. P6.3. 
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Figure P6.3 Power delay profile. 
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Figure P6.4 Power angular pro- 
file. 
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6.11 Calculate the multipath power gain Pt, the mean arrival angle 9q, and 
the RMS angular spread ag for the PAP given in Fig. P6.4. 

6.12 Calculate the multipath power gain Pt, the mean arrival angle 9q, and 
the RMS angular spread erg for the PAP given in Fig. P6.5. 

6.13 For P(9) in Prob. 11, use the Rapport method to calculate the angular 
spread. 

6.14 For P(9) in Prob. 12, use the Rapport method to calculate the angular 
spread. 

6.15 For a transmitter transmitting 1MW of power located at the rectangular 
coordinates (0, 250) and the receiver is located at (1km, 250). Four scatterers 
are located at: (200, 400), (300, 150), (500, 100), (600, 400), and (800, 100). 
Each isotropic scatterer has a reflection coefficient of .7. Account for spherical 
spreading such that the power is inversely proportional to r 2 . 



(a) Derive the time delays and powers associated with each path and 
plot the PDP 

( b ) Derive the angles of arrival and powers associated with each path 
and plot the PAP 

(c) Plot the two dimensional power delay angular profile given in 
Eq. (6.46) 



P(9) 



0 dB 



3 dB 



6 dB 



9 dB 



Figure P6.5 Power angular pro- 
file. 
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6.1 6 For a ring of scatterers where the circle radius a = 20 m and the distance 
between the transmit and receive antennas is 100 m. Allow Py = 1 Watt 

(а) Solve for the power angular profile 

(б) Plot the PAP 

6.17 For a disk of scatterers where the circle radius a = 20 m and the distance 
between the transmit and receive antennas is 100 m. Allow Py = 1 Watt 

(а) Solve for the power angular profile 

(б) Plot the PAP 

6.18 If the channel impulse response is given by h c (t) = .2<5U) + .4S(f — r) + 
.6S(f — 2 t) 

(а) Derive and plot the channel frequency response for .5 < fz < 1.5 

(б) Derive and plot the equalizer frequency response for .5 < fx < 1.5 
(c) Are these plots conjugate inverses? 

6.19 If the channel impulse response is given by h c (t) = .6<5 it) + AS(t — 2z) + 
.2«(f-4r) 

(а) Derive and plot the channel frequency response for .5 < fx < 1.5 

(б) Derive and plot the equalizer frequency response for .5 < fr < 1.5 
(c) Are these plots conjugate inverses? 
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Chapter 

7 

Angle-of-Arrival Estimation 



In the propagation channel topics discussed in Chap. 6, it was appar- 
ent that even for one source there are many possible propagation paths 
and angles of arrival. If several transmitters are operating simultane- 
ously, each source potentially creates many multipath components at 
the receiver. Therefore, it is important for a receive array to be able to 
estimate the angles of arrival in order to decipher which emitters are 
present and what are their possible angular locations. This information 
can be used to eliminate or combine signals for greater fidelity, suppress 
interferers, or both. 

Angle-of-arrival (AOA) estimation has also been known as spectral es- 
timation, direction of arrival (DOA) estimation, or bearing estimation. 
Some of the earliest references refer to spectral estimation as the abil- 
ity to select various frequency components out of a collection of signals. 
This concept was expanded to include frequency-wavenumber prob- 
lems and subsequently AOA estimation. Bearing estimation is a term 
more commonly used in the sonar community and is AOA estimation 
for acoustic problems. Much of the state-of-the-art in AOA estimation 
has its roots in time series analysis, spectrum analysis, periodograms, 
eigenstructure methods, parametric methods, linear prediction meth- 
ods, beamforming, array processing, and adaptive array methods. Some 
of the more useful materials include a survey paper by Godara [1], 
spectrum analysis by Capon [2], a review of spectral estimation by 
Johnson [3] , an exhaustive text by Van Trees [4] and a text by Stoica 
and Moses [5]. 

7.1 Fundamentals of Matrix Algebra 

Before beginning our development of AOA (spectral) estimation meth- 
ods, it is important to review some matrix algebra basics. We will denote 
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all vectors as lower case with a bar. An example is the array vector a. 
We will denote all matrices as upper case also with a bar such as A. 

7.1.1 Vector basics 

Column Vector: The vector a can be denoted as a column vector or as 
a row vector. If a is a column vector or a single column matrix, it can 
be described as 



cti 

a-2 

a-M 



(7.1) 



Row Vectors: If 5 is a row vector or a single row matrix, it can be 
described as 



b — [b\ b- 2 ■■■ b N \ 



(7.2) 



Vector Transpose: Any column vector can be changed into a row vec- 
tor or any row vector can be changed into a column vector by the 
transpose operation such that 



a T — [ai a 2 • • • um\ 

r&ii 







(7.3) 

(7.4) 



Vector Hermitian Transpose: The Hermitian transpose is the conju- 
gate transpose of a vector denoted by the operator H. Thus the Her- 
mitian transpose 1 of a above can be demonstrated as 



a H =[a\ a| ••• a* M \ (7.5) 

R 1 




1 The Hermitian transpose is also designated by the symbol f 
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Vector Dot Product (Inner Product): The dot product of row vector 
with itself is traditionally given by 

bi 

62 

b-b T = [b 1 b 2 ■■■ b N ] ■ . 

pN 

= b\ + bl + --- + b 2 N (7.7) 



Vandermonde Vector: A Vandermonde vector is an M-element vector 
such that 




(7.8) 



Thus, the array steering vector of Eq. (4.8) is a Vandermonde vector. 



7.1.2 Matrix basics 

A matrix is an M x N collection of elements such that 

an 012 • • • auv 
_ 021 a 22 • • • 02 N 

A — . . (7.9) 

OjWl OM2 • • • clmn_ 

where Mx Vis the size or order of the matrix. 

Matrix Determinant The determinant of a square matrix can be 
defined by the Laplace expansion and is given by 

an 012 • • • a\M 

_ 021 022 • • ■ a M 

\A\ = : . . . 

OMl OM2 • • • ClMM 

M 

— ^ a : jCof(a LJ ) for any row index i 
j= 1 

M 

= ^2 aijcofiaij ) for any column index j 
i = 1 



(7.10) 
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where, cofiaij) is the cofactor of the element a,j and is defined by 
cof(aij) = (—l) l+ 'Mjj and M :j is the minor of a,;, . The minor is 
determinant of the matrix left after striking the ith row and the jth 
column. If any two rows or two columns of a matrix are identical, 
the determinant is zero. The determinant operation in MATLAB is 
performed by the command det(A). 



Example 7.1 Find the determinant of A 



12 0 
3 2 1 

5 1-1 



Solution Using the first row indices |A| = l aijcofiay). Thus 



|A| = 1 • (2 ■ (-1) - 1 • 1) - 2 • (3 • (-1) - 5 • 1) + 0 • (3 • (-1) -5-1) 
= 13 



The same result may be found by using the two MATLAB commands: 

» A = [1 2 0;3 2 1;5 1 -1] ; 

» det (A) 
ans = 13 

Matrix Addition Matrices can be added or subtracted by simply 
adding or subtracting the same elements of each. Thus 



C — A ± B =>■ cij — atj ± bij (7.11) 

Matrix Multiplication Matrix multiplication can occur if the column 
index of the first equals the row index of the second. Thus anMxlV 
matrix can be multiplied by an N x L matrix yielding an M x L 
matrix. The multiplication is such that 

N 

C = A - B =>■ cij — 'Y2 a ihbkj (7.12) 

k=i 

Example 7.2 Multiply the two matrices A = 



B = 



7 3 
-1 5 



Solution 



AB = 



9 

17 



-7' 

29 
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This also may be accomplished in MATLAB with the following com- 
mands: 

» A = [1 -2; 3 4] ; 

» B = [7 3 ; - 1 5 ] ; 

» A*B 



ans = 



9 -7 

17 29 



Identity Matrix The identity matrix, denoted by I, is defined as an 
M x M matrix with ones along the diagonal and zeros for all other 
matrix elements such that 



1 0 • • • O' 

0 1 ••• 0 



0 0 ••• 1 



( 7 . 13 ) 



The product of an identity matrix I with any square matrix A yields 
A such that I ■ A — A - 1 — A. The identity matrix can be created in 
MATLAB with the command eye(M). This produces an MxM identity 
matrix. 

Cartesian Basis Vectors The columns of the identity matrix I, are 
called Cartesian basis vectors. The Cartesian basis vectors are 
denoted by Ui,U 2 ,...,Um such that Ui — [1 0 ••• 0] T ,u,2 — 

[0 1 • ■ • 0 ] T , . . . , um — [0 0 • • ■ 1 ] T . Thus, the identity matrix 

can be defined as I — [di U 2 • • • um\ ■ 

Trace of a Matrix The trace of a square matrix is the sum of the 
diagonal elements such that 



N 

Tr(A) = Y J a H ( 7 . 14 ) 

i = 1 



The trace of a matrix in MATLAB is found by the command trace (A). 

Matrix Transpose The transpose of a matrix is the interchange of 
the rows and columns denoted by A T . The transpose of the product 
of two matrices is the product of the transposes in reverse order such 
that (A ■ B) r — B r ■ A T . The transpose in MATLAB is performed with 
the command transpose(A) or A.’. 

Matrix Hermitian Transpose The Hermitian transpose is the trans- 
pose of the conjugate (or the conjugate transpose) of the matrix 
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elements denoted by A H . The Hermitian transpose in MATLAB is 
performed by the operation ctranspose(A) or A’. The determinant 
of a matrix is the same as the determinant of its transpose thus 
\A\ = \A T \. The determinant of a matrix is the conjugate of its Her- 
mitian transpose thus |A| = \A H \*. The Hermitian transpose of the 
product of two matrices is the product of the Hermitian transposes 
in reverse order such that (A • B) H — B H ■ A H . This is an important 
property which will be used later in the text. 

Inverse of a Matrix The inverse of a matrix is defined such that 
A • A = I where A is the inverse of A. The matrix A has an 
inverse provided that |A| A 0. We define the cofactor matrix such 
that C — cof(A) = [( — l) t+ 7 j Ayj] and A (/ is the remaining matrix after 
striking row i and column j. The inverse of a matrix in MATLAB is 
given by inv(A). Mathematically, the matrix inverse is given by 



A 



1 



C T 

W\ 



(7.15) 



Example 7.3 Find the inverse of the matrix A = y 2 g J 

Solution First find the cofactor matrix C and the determinant of A. 



C = 



5 

-3 



2 

1 



|A| = 11 



The inverse of A is then given as 



.4545 -.2727' 

.1818 .0909 

This problem can also be easily solved in MATLAB using the following 
commands: 

» A = [1 3; -2 5] ; 

>> inv(A) 




ans 



.4545 -.2727 

.1818 .0909 



Eigenvalues and eigenvectors of a matrix. The German word eigen 
means appropriate or peculiar. Thus, the eigenvalues and eigenvectors 
of a matrix are the appropriate or peculiar values that satisfy a homo- 
geneous condition. The values of X, that are eigenvalues of the N x N 
square matrix A, must satisfy the following condition 



\XI — A\ = 0 



(7.16) 
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The determinant above is called the characteristic determinant produc- 
ing a polynomial of order IV in X having N roots. In other words 

\xT— A\ = (X - M)U - X 2 ) • • • (X - X N ) (7.17) 

Each of the eigenvalues On, X 2 , ...,X N ) satisfies Eq. (7.16). We may 
now also define eigenvectors associated with the matrix A. If the JVx N 
matrix A has N unique eigenvalues Xj, it has N eigenvectors ej satis- 
fying the following homogeneous equation: 

(XjT- A)e j = 0 (7.18) 

In MATLAB, one can find the eigenvectors and eigenvalues of a ma- 
trix A by the command [EV, V] = eig(A). The eigenvectors are the 
columns of the matrix EV and the corresponding eigenvalues are the 
diagonal elements of the matrix V. The MATLAB command diag(V) 
creates a vector of the eigenvalues along the diagonal of V. 

Example 7.4 Use MATLAB to find the eigenvectors and eigenvalues of the 
matrix [J §] . 

Solution The following lines create the matrix and compute the eigenvectors 
and eigenvalues: 

» A= [1 2,-3 5] ; 

» [EV,V] =eig(A) ; 

» EV 

EV = 

-0.8646 -0.3613 

0.5025 -0.9325 

>> diag(V) 

ans = 

-0.1623 

6.1623 

The first column of EV is the first eigenvector and the corresponding eigen- 
value is A-i = —.1623. The second column of EV is the second eigenvector and 
the corresponding eigenvalue is X 2 = 6.1623. 

These simple vector and matrix procedures will assist us in using MATLAB 
to solve for AOA estimation algorithms that will be described in Sec. 7.3. 



7.2 Array Correlation Matrix 

Many of the AOA algorithms rely on the array correlation matrix. In 
order to understand the array correlation matrix, let us begin with a 
description of the array, the received signal, and the additive noise. 
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Figure 7.1 M-element array with arriving signals. 



Figure 7.1 depicts a receive array with incident planewaves from vari- 
ous directions. 

Figure 7.1 shows D signals arriving from D directions. They are 
received by an array of M elements with M potential weights. Each 
received signal x m (k) includes additive, zero mean, Gaussian noise. 
Time is represented by the kth time sample. Thus, the array output 
y can be given in the following form: 

y(k) = w T ■ x{k) (7.19) 



where 



x(k) = [a(0i) a(0 2 ) ■ ■ -aiOo)] ■ 



Sl(k') 

s 2 (k) 

sn(k) 



+ n(k ) (7.20) 



= A ■ s(k) + n(k ) 



and 

w = [w\ w 2 • • ■ wm] T = array weights 

s(k) = vector of incident complex monochromatic signals at time k 
n(k) = noise vector at each array element m, zero mean, variance 
d(Oj ) — M-element array steering vector for the 0, direction of arrival 
A = [d(0i)a(9 2 ) ■ ■ ■ a{Oo)]M x D matrix of steering vectors aOi 

Thus, each of the D-complex signals arrives at angles Oi and is 
intercepted by the M antenna elements. It is initially assumed that 
the arriving signals are monochromatic and the number of arriving 
signals D < M. It is understood that the arriving signals are time vary- 
ing and thus our calculations are based upon time snapshots of the 
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incoming signal. Obviously if the transmitters are moving, the matrix 
of steering vectors is changing with time and the corresponding arrival 
angles are changing. Unless otherwise stated, the time dependence will 
be suppressed in Eqs. (7.19) and (7.20). In order to simplify the notation 
let us define the M x M array correlation matrix R xx as 

R xx = E[x ■ x H ] = E[{As + ri)(s H A H + ii H )] 

= AE[s • s H ]A H + E[il • ?l H ] 

= AR ss A h + R nn (7.21) 

where R ss — D x D source correlation matrix 

R nn = <r*T=M x M noise correlation matrix 
I = N x N identity matrix 

The array correlation matrix R xx and the source correlation matrix 
R ss are found by the expected value of the respective absolute values 
squared (i.e., R xx = E[x ■ x H ] and R ss = E[s ■ s H ]). If we do not know the 
exact statistics for the noise and signals, but we can assume that the 
process is ergodic, then we can approximate the correlation by use of 
a time-averaged correlation. In that case the correlation matrices are 
defined by 

1 K i K i k 

R xx ~ jz'^ j x{k)x H {k) Rss ~ — ^2s(k)s H (k) R nn ~ — y^n(k)n H (k) 

k=l A k=l k=\ 

Wlien the signals are un correlated, R ss , obviously has to be a diag- 
onal matrix because off-diagonal elements have no correlation. When 
the signals are partly correlated, R ss is nonsingular. When the signals 
are coherent, R ss becomes singular because the rows are linear combi- 
nations of each other [5] . The matrix of steering vectors, A, is an M x D 
matrix where all columns are different. Their structure is Vandermonde 
and hence the columns are independent [6, 7]. Often in the literature, 
the array correlation matrix is referred to as the covariance matrix. 
This is only true if the mean values of the signals and noise are zero. In 
that case, the covariance and the correlation matrices are identical. The 
arriving signal mean value must necessarily be zero because antennas 
cannot receive d.c. signals. The noise inherent in the receiver may or 
may not have zero mean depending on the source of the receiver noise. 

There is much useful information to be discovered in the eigenanal- 
ysis of the array correlation matrix. Details of the eigenstructure are 
described in Godara [1] and are repeated here. Given M- array elements 
with D-narrowband signal sources and uncorrelated noise we can make 
some assumptions about the properties of the correlation matrix. First, 
R xx is an M x M Hermitian matrix. A Hermitian matrix is equal to its 




More Books Visit : www.iqbalkalmati.blogspot.com 



178 Chapter Seven 

complex conjugate transpose such that R xx — R xx . The array correla- 
tion matrix has M eigenvalues (/.], >. 2 , • • ■ , A m ) along with M associated 
eigenvectors E — [eie 2 . . . If the eigenvalues are sorted from small- 

est to largest, we can divide the matrix E into two subspaces such that 
E — [En Es\. The first subspace En is called the noise subspace and is 
composed of M — D eigenvectors associated with the noise. For uncor- 
related noise, the eigenvalues are given as M = X 2 = ■ ■ ■ = Xm-d = 

The second subspace Es is called the signal subspace and is composed of 
D eigenvectors associated with the arriving signals. The noise subspace 
is an M x (M — D) matrix. The signal subspace is an M x D matrix. 

The goal of AOA estimation techniques is to define a function that 
gives an indication of the angles of arrival based upon maxima vs. an- 
gle. This function is traditionally called the pseudospectrum P(9) and 
the units can be in energy or in watts (or at times energy or watts 
squared). There are several potential approaches to defining the pseu- 
dospectrum via beamforming, the array correlation matrix, eigenanal- 
ysis, linear prediction, minimum variance, maximum likelihood, min- 
norm, MUSIC, root-MUSIC, and many more approaches that will not 
be addressed in this chapter. Both Stoica and Moses [5] and Van Trees 
[4] give an in-depth explanation of many of these possible approaches. 
We will summarize some of the more popular pseudospectra solutions 
in the next section. 



7.3 AOA Estimation Methods 



7.3.1 Bartlett AOA estimate 

If the array is uniformly weighted, we can define the Bartlett AOA 
estimate [8] as 

P b { 9) = a H {9)R xx a{9) (7.22) 



The Bartlett AOA estimate is the spatial version of an averaged 
periodogram and is a beamforming AOA estimate. Under the condi- 
tions where s represents uncorrelated monochromatic signals and there 
is no system noise, Eq. (7.22) is equivalent to the following long-hand 
expression: 



Pb(9) = 



D M 

EE g j ( m— l)kd( sin 0 — sin 9i ) 

i=l rrt= 1 



(7.23) 



The periodogram is thus equivalent to the spatial finite Fourier trans- 
form of all arriving signals. This is also equivalent to adding all beam- 
steered array factors for each angle of arrival and finding the absolute 
value squared. 
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Example 7.5 Use MATLAB to plot the pseudo-spectrum using the Bartlett 
estimate for an M = 6 element array. With element spacing d — a/2, uncor- 
related, equal amplitude sources, (si, S 2 ), and 0 % — .1, and the two different 
pairs of arrival angles given by ±10° and ±5°, assume ergodicity. 

Solution From the information given we can find the following: 

d(60=[l e jnsine ■■■ e j5nsiue ] T 

A = [did!) a(9 2 )] R ss = 

Applying Eq. (7.21) we can find R xx for both sets of angles. Substituting R xx 
into Eq. (7.22 ) and using MATLAB, we can plot the pseudospectrum as shown 
in Figs. 7.2a and b. 

Recalling the half-power beamwidth of a linear array from Chap. 4, 
Eq. (4.21), we can estimate the beamwidth of this M = 6 element array 
to be ~8.5°. Thus, the two sources, which are 20° apart are resolvable 
with the Bartlett approach. The two sources, which are 10° apart are not 
resolvable. Herein lies one of the limitations of the Bartlett approach to 
AOA estimation: the ability to resolve angles is limited by the array half- 
power beamwidth. An increase in resolution requires a larger array. For 
large array lengths with d = X/ 2 spacing, the AOA resolution is approx- 
imately 1 /M. Thus, 1 /M is the AOA resolution limit of a periodogram 
and in the case above is an indicator of the resolution of the Bartlett 
method. It should be noted that when two emitters are separated by an 
angle wider than the array resolution, they can be resolved but a bias 
is introduced. This bias cause the peaks to deviate from the true AOA. 
This bias asymptotically decreases as the array length increases. 





7.3.2 Capon AOA estimate 

The Capon AOA estimate [2, 4] is known as a minimum variance dis- 
tortionless response (MVDR). It is also alternatively a maximum likeli- 
hood estimate of the power arriving from one direction while all other 
sources are considered as interference. Thus the goal is to maximize the 
signal-to-intereference ratio (SIR) while passing the signal of interest 
undistorted in phase and amplitude. The source correlation matrix R ss 
is assumed to be diagonal. This maximized SIR is accomplished with a 
set of array weights {w — [w\W2 • • • wm\ t ) as shown in Fig. 7.1, where 
the array weights are given by 



w — 



R xx d 

d H {Q)R xx d{0) 



(7.24) 



where R xx is the unweighted array correlation matrix. 
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Figure 7.2 (a) Bartlett pseudospectrum for 9i = —10°, 0 2 = 10°. 
(6) Bartlett pseudospectrum for 8i = —5°, 0 2 = 5°. 



Substituting the weights of Eq. (7.24) into the array of Fig. 7.1, we 
can then find that the pseudospectrum is given by 

Pc(0) = * (7-25) 

d H (6)R xx d(d) 



Example 7.6 Use MATLAB to plot the pseudo-spectrum using the Capon esti- 
mate for anM= 6 element array. With element spacing d = A/2, uncorrelated, 
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equal amplitude sources, (si, s 2 ), and cr^ = .1, and the pair of arrival angles 
given by ±5°, assume ergodicity. 

Solution We can use the same array correlation matrix as was found in 
Example 7.5. Using MATLAB we produce Fig. 7.3. 

It is clear that the Capon AOA estimate has much greater resolution 
than the Bartlett AOA estimate. In the case where the competing sources 
are highly correlated, the Capon resolution can actually become worse. The 
derivation of the Capon (ML) weights was conditioned upon considering that 
all other sources are interferers. If the multiple signals can be considered 
as multipath signals, with Rayleigh amplitude and uniform phase, then the 
uncorrelated condition is met and the Capon estimate will work. 

The advantage of the Bartlett and Capon estimation methods is that these 
are nonparametric solutions and one does not need an a priori knowledge of 
the specific statistical properties. 

7.3.3 Linear prediction AOA estimate 

The goal of the linear prediction method is to minimize the prediction 
error between the output of the rath sensor and the actual output [3, 9]. 
Our goal is to find the weights that minimize the mean-squared pre- 
diction error. In a similar vein as Eq. (7.24), the solution for the array 
weights is given as 



w„ 



Rxx U >n 

T/. U'm 



(7.26) 
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where u m is the Cartesian basis vector which is the mth column of the 
M x M identity matrix. 

Upon substitution of these array weights into the calculation of the 
pseudo-spectrum, it can be shown that 



PlpJO) = 




(7.27) 



The particular choice for which /nth element output for prediction 
is random. Although the choice made can dramatically affect the final 
resolution. If the array center element is chosen, the linear combina- 
tion of the remaining sensor elements might provide a better estimate 
because the other array elements are spaced about the phase center of 
the array [3]. This would suggest that odd array lengths might provide 
better results than even arrays because the center element is precisely 
at the array phase center. 

This linear prediction technique is sometimes referred to as an 
autoregressive method [4]. It has been argued that the spectral peaks 
using linear prediction are proportional to the square of the signal 
power [3]. This is true in Example 7.7. 



Example 7.7 Use MATLAB to plot the pseudo-spectrum using the linear pre- 
dictive estimate for an M = 6 element array. With element spacing cl = a /2, 
uncorrelated, equal amplitude sources, (si, S 2 ), and a % — .1, and the pair of 
arrival angles given by ±5°, choose the 3rd element of the array as the refer- 
ence element such that the Cartesian basis vector is 113 = [0 0 1 0 0 0 1 7 . 
Assume ergodicity. 

Solution The pseudospectra is given as Plp 3 (0) = - — 3 _* x 3 2 and is plotted 
in Fig. 7.4. 

It is very obvious that under these conditions, the linear predictive method 
provides superior performance over both the Bartlett estimate and the Capon 
estimate. The efficacy of the performance is dependent on the array element 
chosen and the subsequent u n vector. When one selects the arrival signals 
to have different amplitudes, the linear predictive spectral peaks reflect the 
relative strengths of the incoming signals. Thus, the linear predictive method 
not only provides AOA information but it also provides signal strength infor- 
mation. 



7.3.4 Maximum entropy AOA estimate 

The maximum entropy method is attributed to Burg [10, 11]. A further 
explanation of the maximum entropy approach is given in [1, 12]. The 
goal is to find a pseudospectrum that maximizes the entropy function 
subject to constraints. The details of the Burg derivation can be found 
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in the references discussed previously. The pseudospectrum is given by 



Pmej(O) 



I 

a(9) H CjCj I d(6 ) 



(7.28) 



where Cj is the ,/th column of the inverse array correlation matrix ( R X J ) . 



Example 7.8 Use MATLAB to plot the pseudospectrum using the maximum 
entropy AOA estimate for an M = 6-element array, element spacing d — A/2, 
uncorrelated, equal amplitude sources, (si, S2), and = .1, and the pair of 
arrival angles given by ±5°. Choose the 3rd column (£3) of the array correla- 
tion matrix to satisfy Eq. (7.28). Assume ergodicity. 

Solution The pseudospectrum is given as plotted in Fig. 7.5 

It should be noted that the maximum entropy method, when we select the 
C3 column from R ~ x , gives the same pseudospectra as the linear predictive 
method. The choice of 6j can dramatically effect the resolution achieved. The 
center columns of the inverse array correlation matrix tend to give better 
results under the conditions assumed in this chapter. 



7.3.5 Pisarenko harmonic decomposition 
AOA estimate 

The Pisarenko harmonic decomposition (PHD) AOA estimate is named 
after the Russian mathematician who devised this minimum 
mean-squared error approach [13, 14]. The goal is to minimize the 
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mean-squared error of the array output under the constraint that the 
norm of the weight vector be equal to unity. The eigenvector that mini- 
mizes the mean-squared error corresponds to the smallest eigenvalue. 
For an M — 6-element array, with two arriving signals, there will be 
two eigenvectors associated with the signal and four eigenvectors as- 
sociated with the noise. The corresponding PHD pseudospectrum is 
given by 



Pphd(O) = -k ( 7 . 29 ) 

|a ff (d)ei|“ 

where e\ is the eigenvector associated with the smallest eigenvalue . 

Example 7.9 Use MATLAB to plot the pseudo-spectrum using the Pisarenko 
Harmonic Decomposition estimate for an M = 6 element array, element 
spacing d = A./2, uncorrelated, equal amplitude sources, (si, S 2 ), and 0 % = .1, 
and the pair of arrival angles given by ±5°. Choose the first noise eigenvector 
to produce the pseudospectrum 

Solution After finding the array correlation matrix, we can use the eig( ) com- 
mand in MATLAB to find the eigenvectors and corresponding eigenvalues. 
The eigenvalues are given by A* = = ^3 — ^4 = a n = -1> ^5 = 2.95, 

k 6 = 9.25. 
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Figure 7.6 Pisarenko Harmonic Decomposition pseudospectrum for Oi = 
—5°, #2 = 5°. 




The first eigenvector associated with ki is given by 

'—0.143' 

—0.195 

0.065 

e i = 

0.198 

0.612 

-0.723 



Substituting this eigenvector in Eq. (7.29) we can plot Fig. 7.6 

The Pisarenko peaks are not an indication of the signal amplitudes. These 
peaks are the roots of the polynomial in the denominator of Eq. (7.29). It is 
clear that for this example, the Pisarenko solution has the best resolution. 

7.3.6 Min-norm AOA estimate 

The minimum-norm method was developed by Reddi [15] and Kumare- 
san and Tufts [16]. This method is also lucidly explained by Ermolaev 
and Gershman [17]. The min-norm method is only relevant for uniform 
linear arrays (ULA). The min-norm algorithm optimizes the weight vec- 
tor by solving the optimization problem where 

min w H w Egiv = 0 w H u\ = 1 (7.30) 
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where w — array weights 

Es = subspace of D signal eigenvectors = [Hm-d+i cm- d+2 • • • e#] 
M = number of array elements 
D = number of arriving signals 

Mi = Cartesian basis vector (first column of the M x M 
identity matrix) 

= [100- • 0] T 



The solution to the optimization yields the min-norm pseudospectrum 



Pmn(O) = 



(u[E n E%u i) 2 
Idie^ENE^l 2 



(7.31) 



where En — subspace of M — D noise eigenvectors = [ei • • • eM-zd 
a(6) — array steering vector 

Since the numerator term in Eq. (7.31) is a constant, we can normalize 
the pseudospectrum such that 



Pmn(O) = _ "2 

\a{Q) H E N E*ui\ 2 



(7.32) 



Example 7.1 0 Use MATLAB to plot the pseudo-spectrum using the min-norm 
AO A estimate for an M = 6 element array, with element spacing d — a/2, 
uncorrelated, equal amplitude sources, (si, S2), and a % — .1, and the pair of 
arrival angles given by ±5°. Use all noise eigenvectors to construct the noise 
subspace En. 

Solution After finding the array correlation matrix, we can use the eig( ) com- 
mand in MATLAB to find the eigenvectors and corresponding eigenvalues. 
The eigenvalues are broken up into two groups. There are eigenvectors as- 
sociated with the noise eigenvalues given by A-i = ^2 — ^3 — ^4 — a n = T. 
There are eigenvectors associated with the signal eigenvalues >.5 = 2.95 and 
Ag = 9.25. The subspace created by the M — D = 4 noise eigenvectors is 
given as 



-0.14 


-0.56 


-0.21 


0.27 ‘ 


-0.2 


0.23 


0.22 


-0.75 


0.065 


0.43 


0.49 


0.58 


0.2 


0.35 


-0.78 


0.035 


0.61 


-0.51 


0.22 


-0.15 


-0.72 


-0.25 


0 


0.083. 



Applying this information to Eq. (7.32), we can plot the angular spectra in 
Fig. 7.7 
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Figure 7.7 Min-norm pseudospectrum for 9\ = —5°, 62 = 5°. 




It should be noted that the pseudospectrum from the min-norm method 
is almost identical to the PHD pseudospectrum. The min-norm method 
combines all noise eigenvectors whereas the PHD method only uses the 
first noise eigenvector. 

7.3.7 MUSIC AOA estimate 

MUSIC is an acronym which stands for Mu ltiple Sig nal Classification. 
This approach was first posed by Schmidt [18] and is a popular high 
resolution eigenstructure method. MUSIC promises to provide unbi- 
ased estimates of the number of signals, the angles of arrival, and 
the strengths of the waveforms. MUSIC makes the assumption that 
the noise in each channel is uncorrelated making the noise correla- 
tion matrix diagonal. The incident signals may be somewhat correlated 
creating a nondiagonal signal correlation matrix. However, under high 
signal correlation the traditional MUSIC algorithm breaks down and 
other methods must be implemented to correct this weakness. These 
methods will be discussed later in this chapter. 

One must know in advance the number of incoming signals or one 
must search the eigenvalues to determine the number of incoming sig- 
nals. If the number of signals is D, the number of signal eigenvalues 
and eigenvectors is D, and the number of noise eigenvalues and eigen- 
vectors is M—D ( M is the number of array elements). Because MUSIC 
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exploits the noise eigenvector subspace, it is sometimes referred to as 
a subspace method. 

As before we calculate the array correlation matrix assuming uncor- 
related noise with equal variances. 

R xx = AR SS A H + oJJ (7.33) 

We next find the eigenvalues and eigenvectors for R xx . We then pro- 
duce D eigenvectors associated with the signals and M—D eigenvectors 
associated with the noise. We choose the eigenvectors associated with 
the smallest eigenvalues. For uncorrelated signals, the smallest eigen- 
values are equal to the variance of the noise. We can then construct the 
M x (M — D) dimensional subspace spanned by the noise eigenvectors 
such that 



En = [e 1 e2 ••• eM-fl] (7.34) 

The noise subspace eigenvectors are orthogonal to the array steer- 
ing vectors at the angles of arrival G\, 02, ■ ■ ■ , 6d- Because of this 
orthogonality condition, one can show that the Euclidean distance d 2 = 
a(d) H EjqE^aiO) — 0 for each and every arrival angle 0\, O2, ■ ■ ■ , On- 
Placing this distance expression in the denominator creates sharp 
peaks at the angles of arrival. The MUSIC pseudospectrum is now 
given as 



Pmu(9) \d(d) H E N E%d(d)\ (7 ' 35) 

Example 7.11 Use MATLAB to plot the pseudospectrum using the MUSIC 
AOA estimate for an M = 6 element array. With element spacing d = a/2, 
uncorrelated, equal amplitude sources, (si, S2), and a % — .1, and the pair of 
arrival angles given by ±5°. Use all noise eigenvectors to construct the noise 
subspace En- 

Solution After finding the array correlation matrix, we can use the eig( ) com- 
mand in MATLAB to find the eigenvectors and corresponding eigenvalues. 
The eigenvalues are given by U = A 2 = ^3 = 0.4 = cr,f = .1, >.5 = 2.95, and 
Xg — 9.25. The eigenvalues and eigenvectors can be sorted in MATLAB from 
the least to the greatest by the following commands: 

[V,Dia] = eig(Rxx); 

[Y, Index] = sort (diag (Dia) ) ; 



EN = V( Index (1 :M-D) ) ; 
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Figure 7.8 MUSIC pseudospectrum for 9i = —5°, 82 = 5°. 




The subspace created by the M — D = 4 noise eigenvectors again is given as 



‘—0.14 


-0.56 


-0.21 


0.27 


-0.2 


0.23 


0.22 


-0.75 


0.065 


0.43 


0.49 


0.58 


0.2 


0.35 


-0.78 


0.035 


0.61 


-0.51 


0.22 


-0.15 


-0.72 


-0.25 


0 


0.083 



Applying this information to Eq. (7.35), we can plot the angular spectra in 
Fig. 7.8 

Under the conditions stated for the Pisarenko harmonic decomposition, 
the min-norm method, and the MUSIC method, the solutions all have simi- 
lar resolution. It should be understood that in all examples discussed earlier, 
it was assumed that the array correlation matrix was of the form given in 
Eq. (7.33), that the noise variance for all elements was identical, and that the 
different signals were completely uncorrelated. In the case where the source 
correlation matrix is not diagonal, or the noise variances vary, the plots can 
change dramatically and the resolution will diminish. 

In the more practical application we must collect several time samples of 
the received signal plus noise, assume ergodicity, and estimate the correlation 
matrices via time averaging. We can repeat Eq. (7.33) without assuming that 
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we know the signal statistics 




R xx — E[x(k) ■ x H (k)\ ' 


1 K 

» k ^2 % (k) ■ x H (k) 
k=i 


~ AR ss A h + AR sn 


+ R,rsA H + R nn 


where 




1 K 

Rss = 

k=i 


1 K 

Rsn = y ^s(k)n H (k) 

k=l 


1 K 


, * 



Rns = ^2 n (k)s H (k) R rm - ^2 n(k)n H (k ) 



(7.36) 



k=i 



k=i 



Example 7.12 Use MATLAB to plot the pseudospectrum using the MUSIC 
AOA estimate for an M = 6 element array. With element spacing d — 
X/2, the pair of arrival angles are given by ±5°. Assume binary Walsh- 
like signals of amplitude 1, but with only K finite signal samples. Assume 
Gaussian distributed noise of cr^ = .1 but with only K finite noise sam- 
ples. Also assume the process is ergodic and collect K = 100 time sam- 
ples {k = 1, 2, . . . , K) of the signal such that s = sign(randn(M, K)) and 
the noise such that n = sqrt(sig2) * randn (M,K) (sig2 = a%). Calculate 
all correlation matrices via time averaging as defined in Eq. (7.36) This 
can be accomplished in MATLAB by the commands Rss = s * s' /K, Rns = 
n * s'/K, Rsn = s * n! /K, and Rnn = n * n! /K. Assume a pair of arrival 
angles given by ±5°. Use all noise eigenvectors to construct the noise sub- 
space En and find the pseudospectrum. ( Important : MATLAB will not order 
the eigenvalues from least to greatest, so one must sort them before select- 
ing the appropriate noise eigenvectors. The sorting method was shown in the 
previous example. The noise subspace is then given by EN = E(:, index 
(1 :M — £»). The MATLAB code for this example demonstrates the sorting 
process). 

Solution We can generate the 100 time samples of the noise and signal as 
indicated earlier. After finding the array correlation matrix R xx , we can use 
the eig( ) command in MATLAB to find the eigenvectors and corresponding 
eigenvalues. The eigenvalues are given by A-i = .08, A. 2 = .09, A. 3 = .12, 
A 4 = .13, 1.5 = 2.97, and Ag = 9. 

Applying this information to Eq. (7.35), we can plot the angular spectra in 
Fig. 7.9. 

It is clear from the last example that the resolution of the MUSIC algorithm 
begins to diminish as we have to estimate the correlation matrices by time 
averages so that we have R xx = AR SS A H + AR sn + R tlJS A H + R nn . 




More Books Visit : www.iqbalkalmati.blogspot.com 



Angle-of-Arrival Estimation 191 




Figure 7.9 MUSIC pseudospectrum using time averages for = —5°, 
82 = 5°. 



7.3.8 Root-MUSIC AOA estimate 

The MUSIC algorithm in general can apply to any arbitrary array 
regardless of the position of the array elements. Root-MUSIC implies 
that the MUSIC algorithm is reduced to finding roots of a polynomial as 
opposed to merely plotting the pseudospectrum or searching for peaks 
in the pseudospectrum. Barabell [12] simplified the MUSIC algorithm 
for the case where the antenna is a ULA. Recalling that the MUSIC 
pseudospectrum is given by 



Pmu(O) — i — — „ y (7.37) 

\d(e) H E N E%d(e)\ 

One can simplify the denominator expression by defining the 
matrix C = E^E 1 ^ which is Hermitian. This leads to the root-MUSIC 
expression 



Prmu(O) = 



1 

d(9) H Cd(9) 



(7.38) 



If we have an ULA, the znth element of the array steering vector is 
given by 



a m (9) = e^ (ra - 1,s in0 



m= 1,2, ...,M 



(7.39) 
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The denominator argument in Eq. (7.38) can be written as 

M M 

a(0) H Ca(0) — EE C ^ Cmn&j 

m= 1 n= 1 
M—l 

= J 2 c e e jkdtsin6 (7.40) 

e=-M + 1 

where c* is the sum of the diagonal elements of C along the fth diagonal 
such that 



Cl 



'y \ '-'inn 

n—m=i 



c,i 



(7.41) 



It should be noted that the matrix C has off-diagonal sums such that 
co > \ci\ for i ^ 0. Thus the sum of off-diagonal elements is always less 
than the sum of the main diagonal elements. In addition, q — c*_ v For 
a 6 x 6 matrix we have 11 diagonals ranging from diagonal numbers 
l — —5, —4, . . . , 0, . . . , 4, 5. The lower left diagonal is represented by 
l — — 5 whereas the upper right diagonal is represented by l = 5. 
The a coefficients are calculated by c_ 5 = Cq\, c_ 4 = C 51 + C 62 , c _3 = 
C 41 + C 52 + C 6 3 , and so on. 

We can simplify Eq. (7.40) to be in the form of a polynomial whose 
coefficients are a, Thus 



m- 1 

D(z) = ^ 

<=-M+l 



(7.42) 



where 2 = e - jkd am e 

The roots of D(z) that lie closest to the unit circle correspond to the 
poles of the MUSIC pseudospectrum. Thus, this technique is called root- 
MUSIC. The polynomial of Eq. (7.42) is of order 2(M — 1) and thus has 
roots of z 1, Z2, ■ ■ ■ , Z2(m-d- Each root can be complex and using polar 
notation can be written as 

Zi = \Zi\e jarsUi) i = 1,2, ...,2(M-1) (7.43) 

where arg(z;) is the phase angle of z L . 

Exact zeros in D(z) exist when the root magnitudes \z t \ = 1. One can 
calculate the AOA by comparing e ja - rg(Zi) to e jk/lK ' nH ‘ to get 



&i — — sin 




arg (zi) 



(7.44) 
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Example 7.13 Repeat Example 7.12 by changing the noise variance to be 
cr% = .3. Change the angles of arrival to 0\ = —4° and 62 = 8°. Let the 
array be reduced to having only M = 4 elements. Approximate the cor- 
relation matrices by time averaging over K — 300 data points. Superim- 
pose the plots of the pseudospectrum and the roots from root-MUSIC and 
compare. 

Solution One can modify the MATLAB program with the new variables so 
that the 4-element array produces a 4 x 4 matrix C as defined previously. 
Thus C is given by 



C = 



.305 

.388 — .0331 
.092 — .0281 
.216 + .0641 



.388 + .0331 
.6862 

.225 — .0191 
.11 -.Oils 



.092 + .0281 
.225 + .0191 
.6732 

.396 - .0461 



.216 - .0641 
-.11 + .0117 
— .396 + .0461 
.335 



The root-MUSIC polynomial coefficients are given by the sums along 
the 2 M — 1 diagonals. Thus 



c = .216 + .0651, -.203 - .0391, -1.01 - .0991, 2.0, 

-1.01 + .0991, -.203 + .039+ .216 - .0651 

We can use the root command in MATLAB to find the roots and then 
solve for the magnitude and angles of the 2(M — 1) = 6 roots. We can 
plot the location of all 6 roots showing which roots are closest to the unit 
circle as shown in Fig. 7. 10. It is clear that only the four on the right side 
of the y axis are nearest to the unit circle and are close to the expected 
angles of arrival. 

We can choose the four roots closest to the unit circle and replot them 
along with the MUSIC pseudospectrum in Fig. 7.11. 

The roots found with root-MUSIC earlier do not exactly reflect the 
actual location of the angles of arrival of 0\ = —4° and 62 = 8° but they do 
indicate two angles of arrival. The roots themselves show the existence 
of an angle of arrival at near 8° which is not obvious from the plot of the 
MUSIC pseudospectrum. The error in locating the correct root locations 
owes to the fact that the incoming signals are partially correlated, that 
we approximated the correlation matrix by time averaging, and that 
the S/N ratio is relatively low. One must exert care in exercising the 
use of root-MUSIC by knowing the assumptions and conditions under 
which the calculations are made. 

It is interesting to note that the polynomial Diz) is a self-reciprocal 
polynomial such that Diz) = D*(z). The roots of the polynomial Diz) 
are in reciprocal pairs meaning that z\ — A, 2:3 — 4, . . . , Z 2 M -3 = 

Z 2 Z A 

1 
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Figure 7.10 All 6 roots in cartesian coordinates. 
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Figure 7.11 MUSIC pseudospectrum and roots found with root- 
MUSIC for 9 \ = —4° and 62 = 8°. 
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Due to the self-reciprocal symmetry of D(z) ([4]), one can factor D(z) 
using the Fejer method such that 



D{z) = p(z)p*(l/z*) 



(7.45) 



Under these conditions it is sufficient to solve for the roots of the p(z) 
polynomial of degree M — 1. The roots of p{z) are on or within the unit 
circle while the roots of p*( 1/z*) or on or without the unit circle. 

One method has been proposed by Ren and Willis [19] to reduce the 
order of the polynomial D(z) and thus reduce the computational burden 
of finding the roots. 

The polynomial rooting method can also be applied to the Capon 
algorithm where we substitute C = R n ] for C = E^E^. However, since 
the accuracy of the Capon estimation algorithm is much less than the 
MUSIC approach, the root finding also suffers a loss in accuracy. 

The same principles applied to root-MUSIC can also be applied to the 
Min-Norm method to create a root-Min-Norm solution. We can repeat 
Eq. (7.32) 

Prmn(0) = — — ~o (7.46) 

\a(9) H Cui\ 

where u\ — Cartesian basis vector (first column of the 
M x M identity matrix) 

_= [10 0 ... 0] T 

C — Ej\-E ! ^ — a M x M Hermitian matrix 
En — subspace of M — D noise eigenvectors 
aid) — array steering vector 



The product of the Cartesian basis vector and the Hermitian matrix 
results in creating a column vector composed of the first row of the 
matrix C. The column vector based upon the first column of C becomes 
c\ — [Cn Ci 2 • •• C\m\ t where the subscript 1 indicates the first column. 
We can substitute this into Eq. (7.46). 



Prmn(O) = 




1 

d(9) H cic^a(0) 



(7.47) 



In a similar fashion to Eq. (7.42), we can create a polynomial from the 
denominator of Eq. (7.47) given by 

M - 1 

D(z) = C f z l (7.48) 

l=-M + 1 

The coefficients c f are again the sums of the 2 M — 1 matrix diagonals 
of cicf. 
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Example 7.14 Apply the root-Music method to the min-norm method where 
a % = .3, 6i = —2°, 6*2 — 4°, and M = 4. Approximate the correlation matrices 
by again time averaging over K — 300 data points as was done in Exam- 
ple 7.12. Superimpose the plots of the Min-Norm pseudospectrum and the 
roots from root-Min-Norm and compare. 

Solution The first column of the C matrix is given by 



ci 



.19 

-.33 + .02i 
— .06 + .041 
.2 - .051 



We can calculate the matrix c i c ^ and find the polynomial coefficients by 
summing along the diagonals. 



c = .04 - .011, -.08 + .021, -.06 - .011, .1937, -.06 



+.011, -.08- .021, .04 +.011 



We can use the root command in MATLAB to find the roots and then solve 
for the magnitude and angles of the 2(M — 1) = 6 roots. We can plot the 
location of all 6 roots showing which roots are closest to the unit circle as 
shown in Fig. 7.12. We can also superimpose the closest roots onto the plot 
of the Min-Norm pseudospectrum as shown in Fig. 7.13. 



The Min-Norm pseudospectrum has much sharper resolution than 
MUSIC but there is not indication of the AOA at —2°. However, the 




Figure 7.12 All 6 roots in cartesian coordinates. 
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Figure 7.13 Min-norm pseudospectrum and roots found with root- 
Min-Norm for 8 i = —2° and 82 = 4°. 
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root-Min-Norm algorithm gives a fair indication of the location of both 
angles of arrival. 

7.3.9 ESPRIT AOA estimate 

ESPRIT stands for Estimation of Signal Parameters via Rotational 
Invariance Techniques and was first proposed by Roy and Kailath [20] 
in 1989. Useful summaries of this technique are given by both Godara 
[1] and Liberti and Rappaport [21]. The goal of the ESPRIT technique 
is to exploit the rotational invariance in the signal subspace which 
is created by two arrays with a translational invariance structure. 
ESPRIT inherently assumes narrowband signals so that one knows 
the translational phase relationship between the multiple arrays to 
be used. As with MUSIC, ESPRIT assumes that there are D < M 
narrow-band sources centered at the center frequency fo. These sig- 
nal sources are assumed to be of a sufficient range so that the incident 
propagating field is approximately planar. The sources can be either 
random or deterministic and the noise is assumed to be random with 
zero-mean. ESPRIT assumes multiple identical arrays called doublets. 
These can be separate arrays or can be composed of subarrays of one 
larger array. It is important that these arrays are displaced transla- 
tionally but not rotationally. An example is shown in Fig. 7.14 where a 
four element linear array is composed of two identical three-element 
subarrays or two doublets. These two subarrays are translationally 
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Array 1 

A 

f 






Array 2 

Figure 7.14 Doublet composed 
of two identical displaced arrays. 



displaced by the distance d. Let us label these arrays as array 1 and 
array 2. 

The signals induced on each of the arrays are given by 



x\Ui) = [ai(di) 01 ( 02 ) • • -ai(0z))] 



Sl(k) 

S2(k) 



+ tli(k) 



Sj)(k) 



— Ai ■ s(k) + n\{k) 



(7.49) 



and 



X2(k) — A 2 ■ S(k) + U2(k) 

— A\ ■ • s(k ) + nAJA) (7.50) 

where O = diag{e jkdsine \e Jkdsin92 , e jhlsin0n } 

= a D x D diagonal unitary matrix with phase 
shifts between the doublets for each AOA 
Ai = Vandermonde matrix of steering vectors 
for subarrays 1 = 1,2 



The complete received signal considering the contributions of both sub- 
arrays is given as 



x(k ) = 



Xl(k) 

X2ik) 



A 1 

Ai • T 



• s(k) + 



ni(k) 
n 2(k) 



(7.51) 



We can now calculate the correlation matrix for either the complete 
array or for the two subarrays. The correlation matrix for the complete 
array is given by 

R xx = E[x ■ x H ] = AR ss A h + a (7.52) 



whereas the correlation matrices for the two subarrays are given by 
.R 11 — E [x 1 • xf] = AR ss A h + a%I (7.53) 
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and 



R 22 =E[x 2 - xf] = A ®R ss <t> H A h + all (7.54) 

Each of the full rank correlation matrices given in Eq. (7.53) and (7.54) 
has a set of eigenvectors corresponding to the D signals present. Creat- 
ing the signal subspace for the two subarrays results in the two matrices 
Ei and E 2 . Creating the signal subspace for the entire array results in 
one signal subspace given by E x . Because of the invariance structure of 
the array, E x can be decomposed into the subspaces E\ and E 2 . 

Both E i and E 2 are M x D matrices whose columns are composed 
of the D eigenvectors corresponding to the largest eigenvalues of Ru 
and R 22 . Since the arrays are translationally related, the subspaces 
of eigenvectors are related by a unique non-singular transformation 
matrix 4' such that 



EiV = E 2 (7.55) 

There must also exist a unique non-singular transformation matrix T 
such that 



Ei = AT (7.56) 

and 

E 2 = AOT (7.57) 

By substituting Eqs. (7.55) and (7.56) into Eq. (7.57) and assuming that 
A is of full-rank, we can derive the relationship 

7W" 1 = <J> (7.58) 

Thus, the eigenvalues of T must be equal to the diagonal elements 
of O such that Xi — e 7Msin01 , x 2 — e 7faisinl92 , . . . , X D — e 7fa9sm ®» and the 
columns of T must be the eigenvectors of 4^. T is a rotation operator 
that maps the signal subspace Ei into the signal subspace E 2 . One is 
now left with the problem of estimating the subspace rotation operator 
4> and consequently finding the eigenvalues of 4h 
If we are restricted to a finite number of measurements and we also 
assume that the subspaces E i and E 2 are equally noisy, we can esti- 
mate the rotation operator 4> using the total least-squares (TLS) cri- 
terion. Details of the TLS criterion can be found in van Huffel and 
Vandewalle [22]. This procedure is outlined as follows, (see Roy and 
Kailath [20]). 
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■ Estimate the array correlation matrices Rn, R 22 from the data 
samples. 

■ Knowing the array correlation matrices for both subarrays, one can 
estimate the total number of sources by the number of large eigen- 
values in either i?n or R 2 2 . 

■ Calculate the signal subspaces E\ and E 2 based upon the signal eigen- 
vectors of R\ 1 and R 22 . For ULA, one can alternatively construct the 
signal subspaces from the entire array signal subspace E x . E x is an 
M x D matrix composed of the signal eigenvectors. E\ can be con- 
structed by selecting the first M/2 + 1 rows ((M + l)/2 + 1 for odd 
arrays) of E x . E 2 can be constructed by selecting the last M/2+ 1 rows 
((M+ l)/2 + 1 for odd arrays) of E x . 

■ Next form a 2 D x 2 D matrix using the signal subspaces such that 



C = 



'Ef 

M. 



[Ei E 2 ] — EcAEq 



(7.59) 



where the matrix Ec is from the eigenvalue decomposition (EVD) of 
C such that Xi > X 2 > • • • > X 2 d and A = diag Ui, X 2 , . . . , X 2 d) 

Partition Ec into four D x D submatrices such that 



Ec — 



En E12 

E 2 i E 2 2 



(7.60) 



Estimate the rotation operator by 



V = E12E22 



(7.61) 



■ Calculate the eigenvalues of 'I', X 2 , . . . , Xd 

■ Now estimate the angles of arrival, given that Xi — \e ja - rg,ki> 



Oi = sin 



/ arg(Aj)\ 
\ kd ) 



i = 1,2, ...,D 



(7.62) 



If so desired, one can estimate the matrix of steering vectors from 
the signal subspace E s and the eigenvectors of d' given by E,\, such that 

A= E s Ey. 



Example 7.15 Use the ESPRIT algorithm to predict the angles of arrival for 
an M = 4-element array where noise variance a % = .1. Approximate the 
correlation matrices by again time averaging over K = 300 data points as 
was done in Example 7.12. The angles of arrival are 9i — —5°, 9 2 = 10°. 
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Solution The signal subspace for the entire ideal ULA array correlation matrix 
is given by 



E x = 



.78 .41 + .11 

.12 -.02 1 .56 

-.22 + .071 .54 -.051 
-.51 + .251 .46 -.11. 



The two subarray signal subspaces can now be found by taking the first 
three rows of E x to define E\ and the last three rows of E x to define E 2 



.78 


.41 + .11 ' 




' .12- .021 


.56 ' 


.12- .021 


.56 


E 2 — 


-.22+ .071 


.54 - .051 


.-.22 + .071 


.54 - .051. 




.-.51 + .251 


.46- .11 . 



Constructing the matrix of signal subspaces we get 



C = 



~Ef 

E 2_ 



[E\ E 2 \ = 



- .67 

.26 - .061 
.2 + .031 
. .39 



.26 + .061 
.78 

-.37- .121 
.78 + .111 



.2 - .031 
-.37 + .121 
.4 

-.32 + .081 



.39 - 

.78- .111 
-.32 - .081 
.82 . 



Performing the eigendecomposition we can construct the matrix Ec such that 



Ec = 



E ii 
E21 



E\2 

E22, 



.31 .8 

.63 -.091 -.16 

-26 -.051 .57 +.11 



-.44 +.11 -.22 +.111- 
.45 -.61 -.031 

.75 .15 -.111 



.66 



.01 + .021 .07 -.171 



.73 



We can now calculate the rotation operator + = —E\ 2 E 22 given the rotation 
matrix 



+ = -Ei 2 E 2 2 



.58 - .0791 .2 - .051 

-.67 + .231 .94 -.111 



Next we can calculate the eigenvalues of ^ and solve for the angles of arrival 
using 



0i 



sin 



f argOiA 

\ kd J 



-4.82° 



#2 



( arga 2 )\ 



V 



) 



9.85° 



kd 
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Problems 

7.1 For the two matrices A = [g B = [g 2] 

(a) What is Trace(A) and Trace(B) 

( b ) Show that (A - B) T = B T ■ A T 
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7.2 For the two matrices A = I" 1 . 2 J] B= V 1 .1 show that (A-B) H — 

— H — H i] Li Ajl 

7.3 For A = [g 

(a) Find the A 1 by hand 

(b) Find the A” 1 using MATLAB 

7.4 For A = [g ?] 

(a) Solve for all eigenvalues by hand using Eq. (7.16) 

( b ) Solve for all eigenvectors by hand using Eq. (7.17) 

(c) Solve for all eigenvectors and eigenvalues using MATLAB 

7.5 Repeat Prob. 4 for A = [ | 5 |] 

1 2 3 

7.6 For A = 4 5 6 

7 8 9 

(a) What are all eigenvalues and eigenvectors using MATLAB? 

(b) Which eigenvalues are associated with which eigenvectors? 

(c) Use Eq. (7.17), and by hand, prove that the first eigenvalue and the 
first eigenvector satisfy this equation. 

7.7 An N = 3-element array exists with element spacing d — A/2. One arriving 
signal is defined as si(&) = .1, .2, .3; (k = 1, 2, 3) arrives at the angle 8\ — 0°. 
The other signal is defined as S2(k) = .3, .4, .5; (k = 1, 2, 3) arrives at the angle 
02 = 30°. The noise has a standard deviation a = .1. Use Eqs. (7.15), (7.16), 
and (7.17). Use MATLAB. 

(а) What are the array steering vectors a(9i), 6,(62) ? 

(б) What is the matrix of steering vectorsA? 

(c) What are the correlation matrices R ss , R nr , , R xx 

( d ) What are the eigenvalues and eigenvectors of R xx 

7.8 Repeat parts c and d of Prob. 7 using estimates on the correlation matrices 
(R ss , Rnn, R xx )- Allow the noise to be defined in MATLAB as n = 0.1*randn(3, 
3);. Thus the noise is Gaussian distributed but we only can work with three 
time samples. Are the eigenvalues and eigenvectors similar to those found in 
Prob. 7. Why or why not? 

7.9 Plot the normalized Bartlett pseudospectrum Pb(9) using MATLAB, for 
the case where M =l,d = A/2, a % = .2, and 9\ = 3°, 62 = —3°. Set the vertical 
scale to be —30 to 5 dB and horizontal scale to be —15° to 15°. Allow the signals 
si and S 2 to be uncorrelated resulting in R ss = [J ^ ■ 

7.10 Plot the normalized Capon pseudospectrum Pc(9) using MATLAB, for 
the case where M = 7,d = A/2, cr% — .2, and 61 = 3°, 82 = —3°. Set the vertical 
scale to be —30 to 5 dB and horizontal scale to be —15° to 15°. Allow the signals 
Si and S 2 to be uncorrelated resulting in R ss = [J ?] ■ 
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7.11 Plot the normalized linear prediction pseudospectrum Ppp^iO ) using 
MATLAB, for the case where M = 7 , d = A/2, of — .2, and 0-f — 3°, 62 = —3°. 
Set the vertical scale to be —30 to 5 dB and horizontal scale to be —15° to 15°. 
Allow the signals si and S 2 to be uncorrelated resulting in R ss = [q J] . 

7.12 Plot the normalized maximum entropy pseudospectrum Pme 4 (#) using 
MATLAB, for the case where M = 7 , d = A/2, of — .2, and % = 3°, 62 = —3°. 
Set the vertical scale to be —30 to 5 dB and horizontal scale to be —15° to 15°. 
Allow the signals si and S 2 to be uncorrelated resulting in R ss = [1 °] . 

7.13 For the Pisarenko harmonic decomposition pseudospectrum Pphd(S) 
where M = 7 , d = A/2, of = .2, and 6\ = 3°, #2 = —3°. Set the vertical scale to 
be —30 to 5 dB and horizontal scale to be —15° to 15°. Allow the signals si and 
S 2 to be uncorrelated resulting in R ss — [J • 

(a) What is the smallest eigenvalue? 

(b) What is the eigenvector associated with that eigenvalue? 

(c) Plot the normalized pseudospectrum using MATLAB 



7.14 For the min-norm decomposition pseudospectrum Pmn(@) where M =1 , 
cL — A/2, of — .2, and 9\ — 3°, #2 = —3°, set the vertical scale to be —30 to 
5 dB and horizontal scale to be -15° to 15°. Allow the signals si and S 2 to be 
uncorrelated resulting in R ss = [J . 

(a) What is the signal subspace Es? 

(b) What is the noise subspace Fjv? 

(c) Plot the normalized pseudospectrum using MATLAB 



7.15 For the MUSIC decomposition pseudospectrum -PmuSIC^) where M = 7, 
d = A/2, of = .2, and = 3°, ^2 = —3°, set the vertical scale to be —30 to 
5 dB and horizontal scale to be —15° to 15°. Allow the signals si and S 2 to be 
uncorrelated resulting in R ss = [J 5] ■ 

(a) What is the signal subspace Es? 

(b) What is the noise subspace En? 

(c) Plot the normalized pseudospectrum using MATLAB 

7.16 Repeat Example 7.12 using M = 7, d = A/2, of — .2, and 61 — 3°, 62 = 
—3°. Set the vertical scale to be —30 to 5 dB and horizontal scale to be —15° 
to 15°. 

7.17 For an M = 3-element array with d — A/2, of — .2, and = 3°, 62 — —3° 
and given that the signals si and S 2 are uncorrelated, apply the root-MUSIC 
method 

(a) What is the matrix C? 

(b) What are the coefficients causing Eq. (7.47)? 

(c) What are the roots z ; using Eq. (7.49)? 

(d) What are the angles Qi using Eq. (7.50)? 
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7.18 For an M = 3-element array with d — a / 2 , = .2, and 6>i = 3°, 82 = —3° 

and given that the signals si and S 2 are uncorrelated, apply the root-Min-Norm 
method 

(a) What is the matrix Cl 

( b ) What is the first column ci? 

(c) What are the coefficients causing Eq. (7.47)? 

(d) What are the roots z; using Eq. (7.49)? 

(e) What are the angles 0* using Eq. (7.50)? 

7.19 For an M = 4-element array with d — a/2, a % = .2, and 9\ = 3°, 82 ~ 
—3° and given that the signals si and S 2 are uncorrelated, approximate the 
correlation matrices by time averaging over K = 300 data points as was done 
in Example 7.12. Apply the ESPRIT method 

(а) What are the correlation matrices i?n, 7?22? 

(б) Find the signal subspaces E\, E 2 I 
(c) What is the matrix Eel 

id) What is the rotation operator = —E^E^l 

(e) What are the eigenvalues of '!? 

if) What are the predicted angles of arrival? 
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Chapter 

8 

Smart Antennas 



8.1 Introduction 

Traditional array antennas, where the main beam is steered to 
directions of interest, are called phased arrays, beamsteered arrays, or 
scanned arrays. The beam is steered via phase shifters and in the past 
these phase shifters were often implemented at RF frequencies. This 
general approach to phase shifting has been referred to as electronic 
beamsteering because of the attempt to change the phase of the current 
directly at each antenna element. 

Modern beamsteered array antennas, where the pattern is shaped 
according to certain optimum criteria, are called smart antennas. Smart 
antennas have alternatively been called digital beamformed (DBF) 
arrays or adaptive arrays (when adaptive algorithms are employed). 
The term smart implies the use of signal processing in order to shape the 
beam pattern according to certain conditions. For an array to be smart 
implies sophistication beyond merely steering the beam to a direction 
of interest. Smart essentially means computer control of the antenna 
performance. Smart antennas hold the promise for improved radar sys- 
tems, improved system capacities with mobile wireless, and improved 
wireless communications through the implementation of space division 
multiple access (SDMA). 

Smart antenna patterns are controlled via algorithms based upon 
certain criteria. These criteria could be maximizing the signal-to- 
interference ratio (SIR), minimizing the variance, minimizing the mean- 
square error (MSE), steering toward a signal of interest, nulling the 
interfering signals, or tracking a moving emitter to name a few. The 
implementation of these algorithms can be performed electronically 
through analog devices but it is generally more easily performed 
using digital signal processing. This requires that the array outputs be 

207 
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digitized through the use of an A/D converter. This digitization can be 
performed at either IF or baseband frequencies. Since an antenna pat- 
tern (or beam) is formed by digital signal processing, this process is often 
referred to as digital beamforming. Figure 8.1 contrasts a traditional 
electronically steered array with a DBF array or smart antenna. 

When the algorithms used are adaptive algorithms, this process is 
referred to as adaptive beamforming. Adaptive beamforming is a sub- 
category under the more general subject of digital beamforming. Digital 
beamforming has been applied to radar systems [1-5], sonar systems 
[6] , and communications systems [7] to name a few. The chief advantage 
of digital beamforming is that phase shifting and array weighting can 
be performed on the digitized data rather than by being implemented in 
hardware. On receive, the beam is formed in the data processing rather 
than literally being forming in space. The digital beamforming method 
cannot be strictly called electronic steering since no effort is made to 
directly shift the phase of the antenna element currents. Rather, the 
phase shifting is computationally performed on the digitized signal. If 
the parameters of operation are changed or the detection criteria are 
modified, the beamforming can be changed by simply changing an al- 
gorithm rather than by replacing hardware. 

Adaptive beamforming is generally the more useful and effective 
beamforming solution because the digital beamformer merely consists 
of an algorithm which dynamically optimizes the array pattern accord- 
ing to the changing electromagnetic environment. 

Conventional array static processing systems are subject to degrada- 
tion by various causes. The array SNR can be severely degraded by the 
presence of unwanted interfering signals, electronic countermeasures, 




Output 

(a) (b) 



Figure 8.1 (o) Analog beamforming, ( b ) Digital beamforming. 
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clutter returns, reverberation returns (in acoustics), or multipath inter- 
ference and fading. An adaptive array system consists of the antenna 
array elements terminated in an adaptive processor which is designed 
to specifically maximize certain criteria. As the emitters move or change, 
the adaptive array updates and compensates iteratively in order to 
track the changing environment. Many current modern radar systems 
still rely on older electronic scanning technologies. Recent efforts are 
being exerted to modify radar systems to include digital beamform- 
ing and adaptive beamforming techniques [4]. While current modern 
mobile base stations tend to use older fixed beam technologies to sat- 
isfy SDMA, they also would benefit from the use of modern adaptive 
methods and thereby increase system capacities [8] . 

This chapter will review the historical development of digital beam- 
forming and adaptive arrays, explain some basic DBF principles, and 
will cover the more popular DBF methods which include adaptive 
methods. 

8.2 The Historical Development 
of Smart Antennas 

“The development of adaptive arrays began in the late 50s and has been 
more than four decades in the making.” The word adaptive array was 
first coined by Van Atta [9], in 1959, to describe a self-phased array. 
Self-phased arrays merely reflected all incident signals back in the di- 
rection of arrival by using clever phasing schemes based upon phase 
conjugation. Because of the redirection of incident planewaves, these 
arrays were called retrodirective arrays (see an in-depth explanation 
of retrodirective arrays in Chap. 4, Sect. 4.9). Self-phased arrays are 
instantaneously adaptive arrays since they essentially reflect the inci- 
dent signal in a similar fashion to the classic corner reflector. 

Phase-locked loop (PLL) systems were incorporated into arrays in 
the 60s in an effort to construct better retrodirective arrays since it was 
assumed that retrodirection was the best approach [10]. PLLs still are 
used in single beam scanning systems [11]. 

Adaptive sidelohe cancellation (SLC) was first proposed by Howells 
[12, 13] in 1959. This technique allowed for interference nulling, thus 
raising the SIR. The Howells SLC was the first adaptive scheme which 
allowed for automatic interference nulling. By maximizing the gener- 
alized signal-to -noise ratio(SNR), Applebaum developed the algorithm 
governing adaptive interference cancellation [14, 15]. His algorithm 
became known as the Howells-Applehaum algorithm. At the same time, 
through the use of least mean squares (LMS), Widrow and others app- 
lied self-training to adaptive arrays [16, 17]. The Howells-Applebaum 
and Widrow algorithms both are steepest-descent/gradient-search 
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methods. The Howells-Applebaum and Widrow algorithms both con- 
verge to the optimum Wiener solution. The convergence of these meth- 
ods is dependent upon the eigenvalue spread [18] such that larger 
spreads require longer convergence times. The convergence time con- 
stant is given by [19] 



1 

2pXi 



( 8 . 1 ) 



Where p = gradient step size 
ki = ith eigenvalue. 

The eigenvalues are derived from the correlation matrix where the 
largest eigenvalues correspond to the strongest signals and the small- 
est eigenvalues correspond to the weakest signals or noise. The larger 
eigenvalue spreads result in longer convergence times. In the case of 
SLC, the weakest interfering signals are cancelled last as can be seen 
from Eq. (8.1). 

Since the convergence of SLC algorithm was slow for large eigen- 
value spreads, Reed, Mallett, and Brennen developed the direct sample 
matrix inversion (SMI) technique in 1974 [20]. 

The next great complementary advance in adaptive arrays came with 
the application of spectral estimation methods to array processing. 
(Many of these spectral estimation methods are discussed at length 
in Chap. 7.) The spectral estimation methods, achieving higher angu- 
lar resolutions, have come to be known as supperresolution algorithms 
[18]. Capon, in 1969, used a maximum likelihood (ML) method to solve 
for the minimum variance distortionless response (MVDR) of an array. 
His solution maximizes the SIR [21]. Additionally, the linear predictive 
method was used to minimize the mean-squared prediction error lead- 
ing to optimum array weights. The array weights are dependent on the 
array correlation matrix and are given in Eq. (7.22) [22]. In 1972, Burg 
applied the maximum entropy method to spectral estimation and his 
technique was soon adapted to array signal processing [23, 24]. In 1973, 
Pisarenko developed the harmonic decomposition technique based upon 
minimizing the MSE under the constraint that the norm of the weight 
vector be equal to unity [25]. The minimum-norm (min -norm) method 
was developed by Reddi [26] in 1979 and Kumaresan and Tufts [27] in 
1983. The min-norm algorithm optimizes the weight vector by solving 
the optimization problem where the weight vector is orthogonal to the 
signal eigenvector subspace. This too is a spectral estimation problem 
applied to array signal processing. The now famous MUSIC algorithm 
was developed by Schmidt in 1986 [28]. MUSIC is a spectral estimation 
based algorithm which exploits the orthogonality of the noise subspace 
with the array correlation matrix. The estimation of signal parameters 
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via rotational iavariance (ESPRIT) technique was first proposed by Roy 
and Kailath [29] in 1989. The goal of ESPRIT is to exploit the rotational 
invariance in the signal subspace which is created by two arrays with 
a translational invariance structure. 

This latter group of adaptive methods can be considered to be part of 
the array superresolution methods allowing the user to achieve higher 
resolutions than that allowed by the beamwidth of the array. The price 
of increased resolution comes at the cost of greater computational 
intensity. 

Let us review some of the basics for fixed and adaptive algorithms. 



8.3 Fixed Weight Beamforming Basics 
8.3.1 Maximum signal-to-interference ratio 

One criterion which can be applied to enhancing the received signal 
and minimizing the interfering signals is based upon maximizing the 
SIR [7, 30]. Before we engage in the rigorous derivation of the SIR 
optimization, let us review a heuristic approach which was used in 
Chap. 4. It is intuitive that if we can cancel all interference by placing 
nulls at their angles of arrival, we will automatically maximize the SIR. 

Repeating the basic development of Chap. 4, let us assume anAf = 
3-element array with one fixed known desired source and two fixed 
undesired interferers. All signals are assumed to operate at the same 
carrier frequency. Let us assume a three-element array with the desired 
signal and interferers as shown in Fig. 8.2. 

The array vector is given by 

a = [ e - jkclsm0 1 e jkd^X ( 8 . 2 ) 





Figure 8.2 Three-element array with desired and interfering signals 
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The, as yet to be determined, array weights for optimization are given by 

w H = [w\ W 2 w 3 ] (8.3) 



Therefore, the general total array output is given as 



y = w H ■ d = Wl e- jkdsin0 +w 2 + w 3 e jkdsiDe 



(8.4) 



The array output for the desired signal will be designated by y s whereas 
the array output for the interfering or undesired signals will be desig- 
nated by yi and y 2 . Since there are three unknown weights, there must 
be three conditions satisfied. 

Condition 1: y s — w H ■ do = wie~ jkdsm9 ° + W 2 + w 3 e-’ kdsin0 ° = 1 

Condition 2: yi = w H ■ 01 = w ie - jk(lsm(h +w 2 + w 3 e jkdsin(>1 = 0 

Condition 3: y 2 = w H ■ d 2 = w ie ~ jkdsin02 + w 2 + w 3 e jkdsm02 = 0 



Condition 1 demands that y s — 1 for the desired signal, thus allowing 
the desired signal to be received without modification. Conditions 2 
and 3 reject the undesired interfering signals. These conditions can be 
recast in matrix form as 



w H ■ A = u\ (8.5) 

where A — [do di d 2 ] = matrix of steering vectors 

di = [1 0 • • • 0] T = Cartesian basis vector 

One can invert the matrix to find the required complex weights w\, u> 2 , 
and wa by using 

w H = u\ ■ A - x (8.6) 

As an example, if the desired signal is arriving from do = 0° while 
di = —45° and 0 2 = 60°, the necessary weights can be calculated to be 



w\ 




bo 

00 

1 

0 

<1 

•X . 


W* 2 


= 


.45 


W l 




.28 4- .07i_ 



The array factor is shown plotted in Fig. 8.3. 

The Cartesian basis vector in Eq. (8.6) indicates that the array 
weights are taken from the first row of A -1 . 

The previous development is predicated on the fact that the desired 
signal and the total of the interfering signals make A an invertible 
square matrix. A must be an N x N matrix with N-array elements 
and ^-arriving signals. In the case where the number of interferers 
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Figure 8.3 Sidelobe cancellation. 



is less than M — 1, Godara [19] has provided an equation which gives 
an estimate of the weights. However, his formulation requires noise 
be added in the system because the matrix inversion will be singular 
otherwise. Using the Godara method we have 

w H = u\- A h (A- A h + (8.8) 

where u[ is the Cartesian basis vector whose length equals the total 
number of sources. 

Example 8.1 For an M = 5-element array with d = A/2, the desired signal 
arriving at 9 = 0° and one interferer arrives at —15° while the other inter- 
ferer arrives at +25°. If the noise variance is cr^ = .001, use the array weight 
estimation found in Eq. (8.8) to find the weights and plot. 

Solution The problem is solved in MATLAB using sa ex8 l.m. The matrix of 
steering vectors is given as 



A = [a 0 ai a 2 ] 



where 

d 0 = [1 1 1 1 1] T 

a n = [g-/ 2jrsinfl n g-fisinfl, 1 e j7isin9 n e j2jr sin0„]T n=12 

Since only three sources are present, iii = [1 0 0 1 7 
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Substituting into Eq. (8.8) 



-.26 + .nr 

.17 + .08) 
.13 

.17- .08) 
.26- .lit 



T 




The plot of the array factor is shown in Fig. 8.4. The advantage of the 
Godara method is that the total number of sources can be less than the 
number of array elements. 

This basic sidelobe canceling scheme works through an intuitive 
application of the array steering vector for the desired signal and inter- 
fering signals. However, by formally maximizing the SIR, we can derive 
the analytic solution for all arbitrary cases. We will closely follow the 
derivation given in both [7] and [30] . 

The general non-adaptive conventional narrowband array is shown 
in Fig. 8.5. 

Figure 8.5 shows one desired signal arriving from the angle do and N 
interferers arriving from angles 0\, , On- The signal and the interfer- 
es are received by an array of M elements with M potential weights. 
Each received signal at element m also includes additive Gaussian 
noise. Time is represented by the Mh time sample. Thus, the weighted 



3 

o. 



o. 



0 . 



0 . 



0. 



0. 



0. 



0. 




0 

e 



90 



Figure 8.4 Array pattern with approximate nulls at —15° and 25°. 
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Figure 8.5 Traditional narrowband array. 



array output y can be given in the following form: 

y(,k) — w H • x{k ) 

where 



x(k ) = aos(k) + [a\ cl-l • • • a/vl • 



= X s (k) + Xi(k) + n(k) 



i i®' 

hik) 

iN(k). 



(8.9) 



+ n{k) (8.10) 



with 

w — [w\ W 2 ■ ■ ■ wm\ T = array weights 
x s (k) — desired signal vector 
Xj(k) = interfering signals vector 
n(k) — zero mean Gaussian noise for each channel 
di — M-element array steering vector for the 0, direction of arrival 
We may rewrite Eq.(8.9) using the expanded notation in Eq. (8.10) 

y(k) = ui H ■ [x s (k) + Xi(k) + n(k)] = w H ■ [x s (k) + u(k)} (8.11) 

where 



u(k ) = Xi(k ) + n(k ) = undesired signal 

It is initially assumed that all arriving signals are monochromatic 
and the total number of arriving signals N + 1 < M. It is understood 
that the arriving signals are time varying and thus our calculations 
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are based upon fe-time snapshots of the incoming signal. Obviously, if 
the emitters are moving, the matrix of steering vectors is changing 
with time and the corresponding arrival angles are changing. Unless 
otherwise stated, the time dependence will be suppressed in Eq. (8.9) 
through (8.11). 

We can calculate the array correlation matrices for both the desired 
signal (Mss) and the undesired signals (R uu )- The literature often calls 
these matrices the array covariance matrices. However, the covariance 
matrix is a mean removed correlation matrix. Since we do not generally 
know the statistical mean of the system noise or the front end detec- 
tor output, it is best to label all R matrices as correlation matrices. If 
the process is ergodic and the time average is utilized, the correlation 
matrices can be defined with the time average notation as R ss and R uu . 

The weighted array output power for the desired signal is given by 

cr s 2 = E \w H ■ x s \ 2 — w H ■ R ss ■ w (8.12) 

where 

R ss = E [x s xf] = signal correlation matrix 
The weighted array output power for the undesired signals is given by 
o* = E[\w H ■ u\ 2 ] =w H ■ R uu - u> (8.13) 

where it can be shown that 

Ruu — Rii + Run (8.14) 

with 

Ru — correlation matrix for interferers 
R nn — correlation matrix for noise 



The (SIR) is defined as the ratio of the desired signal power divided by 
the undesired signal power. 



SIR = = w H ■ R ss ■ w 

o',! w H ■ R uu ■ w 



The SIR can be maximized in Eq. (8.15) by taking the derivative with 
respect to w and setting the result equal to zero. This optimization 
procedure is outlined in Harrington [31]. Rearranging terms, we can 
derive the following relationship 



R ss ■ w — SIR ■ R uu ■ w 



(8.16) 



or 



Ruu Rss • w — SIR ■ W 



(8.17) 
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Equation (8.17) is an eigenvector equation with SIR being the eigen- 
values. The maximum SIR ( SIR max ) is equal to the largest eigenvalue 
/.max for the Hermitian matrix R~J-R SS . The eigenvector associated with 
the largest eigenvalue is the optimum weight vector w 0 pt . Thus 

R uu R ss ■ U>SIR — ^max ^opt — SIR max wsir (8.18) 

Since the correlation matrix is defined as R ss — E [|s| 2 ]bo • a$, we can 
pose the weight vector in terms of the optimum Wiener solution. 

wsir = P ■ R-uu ' “o (8.19) 

where 

E[\s\ 2 ] H f oon) 

P — «0 ' W SIR (8.20) 

OItX max 

Although Eq. (8.19) casts the weight vector in the optimum Wiener 
solution form, the weight vector is already known as the eigenvector 
found in Eq. (8.18). 



Example 8.2 The M = 3-element array with spacing d — ,5\ has a noise 
variance a 2 — .001, a desired received signal arriving at 9q — 30°, and two 
interferers arriving at angles 9\ = —30° and 02 = 45° . Assume that the signal 
and interferer amplitudes are constant. Use MATLAB to calculate SIR max , 
the normalized weights from Eq. (8.18), and plot the resulting pattern. 



Solution Based upon the incident angles of arrival for the desired signal and 
interferers along with the array vector a, we can find the correlation matrices 
of the signal and undesired signals as 





■ i i -r 






Rss — 


—i 1 i 








.-1 -i 1 _ 








2.001 


-.61- .201 


— 1.27 — .96i“ 


Ruu — 


-.61 + .201 


2.001 


— .61 — .20i 




.-1.27 + .961 


— .61 + ,20t 


2.001 



The largest eigenvalue for Eq. (8.18) is given in MATLAB as 
SIRmax = /-max = 679 



The array weights are arbitrarily normalized by the center weight value. 
Thus 



[U.48 + .5(1 



w sir = 



1.48- .51 



The derived pattern is shown plotted in Fig. 8.6. 
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Figure 8.6 Maximum SIR pattern. 

8.3.2 Minimum mean-square error 

An alternative means for optimizing the array weights is found by min- 
imizing the MSE. Figure 8.5 must be modified in such a way as to min- 
imize the error while iterating the array weights. The modified array 
configuration is shown in Fig. 8.7. 




Figure 8.7 MSE adaptive system. 
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The signal d{k) is the reference signal. Preferably the reference signal 
is either identical to the desired signal s(k) or it is highly correlated with 
s{k) and uncorrelated with the interfering signals i„(k). If s(k) is not 
distinctly different from the interfering signals, the minimum mean- 
square technique will not work properly. The signal e(k) is the error 
signal such that 

e(k) — d{k) — w H x(k) (8.21) 

Through some simple algebra, it can be shown that the MSE is given 

by 

\e(k)\ 2 = \d{k)\ 2 — 2 d{k)w H x{k) + w H x(k)x H {k)u> (8.22) 

For the purposes of simplification, we will suppress the time dependence 
notation k. Taking the expected value of both sides and simplifying the 
expression we get 

E [|s| 2 ] — E [|d| 2 ] — 2 w H r + w H R xx w (8.23) 

where the following correlations are defined: 

r — E[d* ■ x] = E[d* ■ (x s + x* + ft)] (8.24) 

R xx = E [ xx H ] = R ss + R uu (8.25) 

R ss = E [x s xf] (8.26) 

Ruu — Rii + Run (8.27) 

The expression in Eq. (8.23) is a quadratic function of the weight 
vector w. This function is sometimes called the performance surface or 
cost function and forms a quadratic surface in M-dimensional space. 
Since the optimum weights provide the minimum MSE, the extremum 
is the minimum of this function. A trivial example is given for a two- 
element array that produces a two-dimensional surface as depicted in 
Fig. 8.8. As the desired angle-of-arrival (AOA) changes with time, the 
quadratic surface minimum changes with time in the weight plane. 

In general, for an arbitrary number of weights, we can find the min- 
imum value by taking the gradient of the MSE with respect to the 
weight vectors and equating it to zero. Thus the Wiener-Hopf equation 
is given as 

V w (E[\e\ 2 ]) = 2R xx u> -2r = 0 (8.28) 

Simple algebra can be applied to Eq. (8.28) to yield the optimum Wiener 
solution given as 



wmse = R xx r 



(8.29) 
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If we allow the reference signal d to be equal to the desired signal 
s, and if s is uncorrelated with all interferers, we may simplify the 
correlation f. Using Eqs. (8.10) and (8.24) the simplified correlation f is 



where 



r = 1? [s * -x] = S • ao 



S—E [|s| 2 ] 



(8.30) 



The optimum weights can then be identified as 

wmse = SR~^do (8.31) 



Example 8.3 The M = 5-element array with spacing d = .5/. has a received 
signal energy S = 1 arriving at 8q — 20°, and two interferers arriving at 
angles 0\ = —20° and 02 = 40°, with noise variance cr% = .001. Use MATLAB 
to calculate the optimum weights and plot the resulting pattern. 

Solution MATLAB code sa ex8 3. m is used. The array vectors for the desired 
signal and the two interferers are given by 



--.55 - .84T 




--.55 + .841' 




'-.62 + .78 f 


4 ^ 

CO 

1 

bo 

CO 




.48 + .881 




J 

4 ^ 

CO 

1 

CD 

O 


1.0 


di = 


1.0 


a 2 = 


1.0 


.48 + .881 




4^ 

CO 

1 

bo 

00 




-.43 + .901 


.-.55 + .841. 




.-.55 - .841. 




.-.62 - .781. 
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Figure 8.9 Minimum MSE pattern for 5-element array. 



The correlation matrix for the array is given by 





3.0 




.51- .901 


-1.71 + 


-d 

Op 


-1.01 


+ 


.221 


-1.02 


- 


.971 




.51 + 


.901 


3.0 


.51- 


.901 


-1.71 


+ 


Cp 


-1.01 


+ 


.221 


Rxx — 


-1.71- 


.781 


.51 + .901 


3.0 




.51 


- 


.901 


-1.71 


+ 


-d 

cp 




-1.01- 


.221 


-1.71- .781 


.51 + 


.901 


3 


0 




.51 


- 


.901 




-1.02 + 


.971 


-1.01- .221 


-1.71- 


.781 


.51 


+ 


.901 


3 


0 





The weights can now be calculated by using Eq. (8.25) to get 



'-. 11 - . 211 ' 
.18 — .08) 



WMSE = 



.21 



.18 + .08) 
-.11 + .211 



Applying the weights to the array vector, we can plot the resulting minimum 
MSE pattern Fig. 8.9. 



8.3.3 Maximum likelihood 

The maximum likelihood (ML) method is predicated on the assumption 
that we have an unknown desired signal x s and that the unwanted sig- 
nal n has a zero mean Gaussian distribution. The goal of this method 
is to define a likelihood function which can give us an estimate on the 
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desired signal. Details of the ML approach are found in excellent treat- 
ments by Van Trees [32, 33]. This fundamental solution falls under the 
general category of estimation theory. Referring to Fig. 8.10 it should 
be noted that no feedback is given to the antenna elements. The input 
signal vector is given by 

x — dos + h = x s + h (8.32) 

The overall distribution is assumed to be Gaussian but the mean is 
controlled by the desired signal x s . The probability density function can 
be described as the joint probability density p(x\x s ). This density can 
be viewed as the likelihood function (Haykin [34] and Monzingo [30]) 
that can be used to estimate the parameter x s . The probability density 
can be described as 

P {X\X S ) = 1 e -((x-aos)^(x-a 0 s)) (8 . 33) 

V 2jt(j n 

where a n — noise standard deviation 

R nn — o%I — noise correlation matrix 

Since the parameter of interest is in the exponent, it is easier to work 
with the negative of the logarithm of the density function. We shall call 
this the log-likelihood function. Thus, we can define the log-likelihood 
function as 



L [x] = — In [p(x|x s )] = C(x — aos) H Rf t ^(x — aos ) (8.34) 



where C — constant 
Rnn = E [fih H ] 

Let us define our estimate of the desired signal, called s, that maxi- 
mizes the log-likelihood function. The maximum of L[x] is found by tak- 
ing the partial derivative with respect to s and setting it equal to zero. 




Figure 8.10 Traditional array. 
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Thus 



3 L [x] 

3s 



= 0 = -2d"R,J L x + 2sd^ Rjd l} 



Solving for s we get 



Thus 



* 

a H R- 
u 0 -‘bin 



-X — W 



a 0 



H - 
ML"' 



W ML — 




(8.35) 



(8.36) 



(8.37) 



Example 8.4 The M — 5-element array with spacing d. — ,5k has a received 
signal arriving at the angle do = 30°, with noise variance = .001. Use 
MATLAB to calculate the optimum weights and plot the resulting pattern. 



Solution Since we are assuming that the noise is zero-mean Gaussian noise, 
the noise correlation matrix is the identity matrix given as 



1 
0 

Rnn = o '$ I 0 
0 
0 

The array steering vector is given by 



0 0 
1 0 
0 1 
0 0 
0 0 

"-1 " 
— It 
1 
It 
-1 

The calculated array weights are given as 



a 0 = 



Rj&o 



WML = 



a; 



"R, 



= -2ao 



-*0 J -mn 

The subsequent array factor plot is shown in Fig 8.11. 



8.3.4 Minimum variance 

The minimum variance solution [7] is sometimes called the minimum 
variance distortionless response (MVDR) [34] or the minimum noise 
variance performance measure [30], The term distortionless is applied 
when it is desired that the received signal is undistorted after the appli- 
cation of the array weights. The goal of the MV method is to minimize 
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Figure 8.11 Maximum likelihood pattern for five element array. 



the array output noise variance. It is assumed that the desired and 
unwanted signals have zero mean. We may again use the array config- 
uration of Fig. 8.10. The weighted array output is given by 

y = iu H x = w H tios + w H u (8.38) 

In order to ensure a distortionless response, we must also add the con- 
straint that 

w H ao = 1 (8.39) 

Applying the constraint to Eq. (8.38), the array output is given as 

y = s + w H u (8.40) 

In addition, if the unwanted signal has zero mean, the expected value 
of the array output is given by 

E\y] = s (8.41) 

We may now calculate the variance of y such that 

cr^y = E[\w h x\ 2 ] = 2?[|s + w H u\ 2 ] 

= w h R uu w (8.42) 



where 
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We can minimize this variance by using the method of Lagrange [35]. 
Since all the array weights are interdependent, we can incorporate the 
constraint in Eq. (8.39) to define a modified performance criterion, or 
cost function, that is a linear combination of the variance and the con- 
straint such that 

J(w) = + A(1 - w H a 0 ) 

w h R uu w -H- s , 0 ,o^ 

— — h A( 1 — w do) (8.43) 

where X is the Lagrange multiplier and J{w) is the cost function. 

The cost function is a quadratic function and can be minimized by 
setting the gradient equal to zero. Thus 

V w J(iv) = R uu wmv ~ Xao — 0 (8.44) 



Solving for the weights, we conclude 

w M v = i-Rfu^o (8.45) 



In order to solve for the Lagrange multiplier (A.) we can substitute 
Eq. (8.39) into Eq. (8.45). Thus 



X = 




(8.46) 



Upon substituting Eq. (8.46) into Eq. (8.45), we arrive at the minimum 
variance optimum weights 



WMV = 



RiJdp 

d^R-J-do 



(8.47) 



It should be noted that the minimum variance solution is identical 
in form to the ML solution. The only difference is that the ML ap- 
proach requires that all unwanted signals combined are zero mean 
and have a Gaussian distribution. However, with the minimum vari- 
ance approach, the unwanted signal can include interferers arriving at 
unwanted angles as well as the noise. Thus, the minimum variance 
solution is more general in its application. 



Example 8.5 The M — 5-element array with spacing d — ,5X has a received 
signal arriving at the angle 9q = 30°, one interferer arriving at —10°, and 
noise with a variance csf = .001. Use MATLAB to calculate the optimum 
weights and plot the resulting pattern. 

Solution It is a simple matter to slightly modify the MATLAB code from 
Example 8.3 to include the interferer at —10°. The MATLAB code used is 




IAF(0)l 



More Books Visit : www.iqbalkalmati.blogspot.com 



226 Chapter Eight 



sa_ex8_5.m. The unwanted signal correlation matrix is given by 



1.0 


.85 + .52) 


.46 + .89) 


-.07 + .99) 


-.57 + .82) 


.85 - .52) 


1.0 


.85 + .52) 


.46+ .89) 


-.07 + .99) 


.46 - .89) 


.85 - .52) 


1.0 


.85 + .52) 


.46 + .89) 


1 

0 
<1 

1 

b 

CD 


.46 - .89) 


.85 - .52) 


1.0 


.85 + .52) 


\ 

bi 

-d 

1 

bo 

to 


\ 

0 

-d 

1 

b 

CD 


05 

GO 

1 

CD 


.85 - .52) 


1.0 



The desired array vector is 

'-1 

— 1) 



a(9 0 ) = 



1 

It 



-1 



Using Eq. (8.40), the calculated minimum variance weights are given as 

'-.19+ .04)' 

.03- .19) 



w = 



.25 



.03 + .19) 
-.19- .04) 



The subsequent array factor plot, with cancellation for interference, is shown 
in Fig 8.12. It should be noted that the minimum variance solution allows 
for canceling interferers and a null has been placed near — 10° . 




Figure 8.12 Minimum variance pattern for 5-element array. 
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8.4 Adaptive Beamforming 

The fixed beamforming approaches, mentioned in Sec. 8.3, which 
included the maximum SIR, the ML method, and the MV method were 
assumed to apply to fixed arrival angle emitters. If the arrival angles 
don’t change with time, the optimum array weights won’t need to be 
adjusted. However, if the desired arrival angles change with time, it is 
necessary to devise an optimization scheme that operates on-the-fly so 
as to keep recalculating the optimum array weights. The receiver sig- 
nal processing algorithm then must allow for the continuous adaptation 
to an ever-changing electromagnetic environment. The adaptive algo- 
rithm takes the fixed beamforming process one step further and allows 
for the calculation of continuously updated weights. The adaptation 
process must satisfy a specified optimization criterion. Several exam- 
ples of popular optimization techniques include LMS, SMI, recursive 
least squares (RLS), the constant modulus algorithm (CMA), conjugate 
gradient, and waveform diverse algorithms. We will discuss and explain 
each of these techniques in the following sections. 

8.4.1 Least mean squares 

The least mean squares algorithm is a gradient based approach. 
Monzingo [30] gives an excellent fundamental treatment of this 
approach. Gradient based algorithms assume an established quadratic 
performance surface such as discussed in Sec. 8.3.2. When the perfor- 
mance surface is a quadratic function of the array weights, the perfor- 
mance surface J( if) is in the shape of an elliptic paraboloid having one 
minimum. One of the best ways to establish the minimum is through 
the use of a gradient method. We can establish the performance sur- 
face (cost function) by again finding the MSE. The error, as indicated 
in Fig. 8.7, is 

s(k') — d{k) — w H (k)x(k) (8.48) 

The squared error is given as 

\s{k)\ 2 — | d(k) — w H {k)x{k) | 2 (8.49) 

Momentarily, we will suppress the time dependence. As calculated in 
Sec. 8.3.2, the cost function is given as 

J(w) = D — 2w H r + w H R xx w (8.50) 



where 



D = E[\d\ 2 ] 
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We may employ the gradient method to locate the minimum of Eq. (8.50). 
Thus 



V w (J(w)) = 2R xx w-2r (8.51) 

The minimum occurs when the gradient is zero. Thus, the solution for 
the weights is the optimum Wiener solution as given by 

w op t — R xx f (8.52) 

The solution in Eq. (8.52) is predicated on our knowledge of all signal 
statistics and thus in our calculation of the correlation matrix. 

In general, we do not know the signal statistics and thus must resort 
to estimating the array correlation matrix (R xx ) and the signal corre- 
lation vector (r) over a range of snapshots or for each instant in time. 
The instantaneous estimates of these values are given as 



R xx (k) ~ xik)x H (k) 


(8.53) 


r(k) ~ d*(k)x(k) 


(8.54) 



We can employ an iterative technique called the method of steepest 
descent to approximate the gradient of the cost function. The direction of 
steepest descent is in the opposite direction as the gradient vector. The 
method of steepest descent can be approximated in terms of the weights 
using the LMS method advocated by Widrow [16, 17]. The steepest 
descent iterative approximation is given as 

w(k + 1) = w(k) — -p'Vw ( J(w(k))) (8.55) 

where, q is the step-size parameter and is the gradient of the per- 
formance surface. 

The gradient of the performance surface is given in Eq. (8.51). If we 
substitute the instantaneous correlation approximations, we have the 
LMS solution. 

w(k + 1) = w(k) — p[R xx w — r] 

= w(k ) + pe*(k)x(k) (8.56) 



where 



e(k ) = d(k) — w H {k)x{k) — error signal 

The convergence of the LMS algorithm in Eq. (8.56) is directly pro- 
portional to the step-size parameter q. If the step-size is too small, the 
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convergence is slow and we will have the overdamped case. If the con- 
vergence is slower than the changing angles of arrival, it is possible that 
the adaptive array cannot acquire the signal of interest fast enough to 
track the changing signal. If the step-size is too large, the LMS algo- 
rithm will overshoot the optimum weights of interest. This is called the 
underdamped case. If attempted convergence is too fast, the weights 
will oscillate about the optimum weights but will not accurately track 
the solution desired. It is therefore imperative to choose a step-size in a 
range that insures convergence. It can be shown that stability is insured 
provided that the following condition is met [30] . 

0 < p < — — (8.57) 

where X max is the largest eigenvalue of R xx . 

Since the correlation matrix is positive definite, all eigenvalues are 
positive. If all the interfering signals are noise and there is only one 
signal of interest, we can approximate the condition in Eq. (8.57) as 

0 < p < ^ — (8.58) 

2 trace [R xx ] 

Example 8.6 An M = 8-element array with spacing d = ,5a has a received 
signal arriving at the angle 9q = 30°, an interferer at 9\ — —60°. Use 
MATLAB to write an LMS routine to solve for the desired weights. Assume 
that the desired received signal vector is defined by x s (k) = dgsik), where 
s(k) = cos(2*pi*t(k)/T); with T = 1 ms and t = (1:100)*T/100. Assume 
the interfering signal vector is defined by xi(k) = a\i(k), where i(k) = 
randn(l,100);. Both signals are nearly orthogonal over the time interval T. 
Let the desired signal d(k ) = s(k). 

Use the least mean square algorithm given in Eq. (8.56) to solve for the 
optimum array weights. Assume that the initial array weights are all zero. 
Allow for 100 iterations. Using MATLAB: 

(а) Let step size fi = .02. 

(б) Calculate the eight array weights for 100 iterations. 

(c) Plot the resulting weights magnitude vs. iteration number. 

(d) Plot the desired signal s(k) and the array output y{k). 

(e) Plot the mean square error |e| 2 . 

( f) Plot the array factor using the final weights calculated. 

Solution The MATLAB code sa ex8 6.m is used to solve the problem. The 
magnitude of the weights vs. iteration number is shown in Fig. 8.13. 
Figure 8.14 shows how the array output acquires and tracks the desired sig- 
nal after about 60 iterations. Figure 8.15 shows the resulting mean square 
error which converges to near zero after 60 iterations. Figure 8.16 shows the 
final weighted array which has a peak at the desired direction of 30° and a 
null at the interfering direction of —60°. 




More Books Visit : www.iqbalkalmati.blogspot.com 



230 Chapter Eight 




8.4.2 Sample matrix inversion 

One of the drawbacks of the LMS adaptive scheme is that the algo- 
rithm must go through many iterations before satisfactory convergence 
is achieved. If the signal characteristics are rapidly changing, the LMS 




Figure 8.14 Acquisition and tracking of desired signal. 
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adaptive algorithm may not allow tracking of the desired signal in a sat- 
isfactory manner. The rate of convergence of the weights is dictated by 
the eigenvalue spread of the array correlation matrix. In Example 8.6, 
the LMS algorithm did not converge until after 70 iterations. 70 iter- 
ations corresponded to more than half of the period of the waveform 




Figure 8.16 Weighted LMS array. 
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of interest. One possible approach to circumventing the relatively slow 
convergence of the LMS scheme is by use of SMI method [7, 23, 36]. 
This method is also alternatively known as direct matrix inversion 
(DMI) [30]. The sample matrix is a time average estimate of the ar- 
ray correlation, matrix using IT-time samples. If the random process 
is ergodic in the correlation, the time average estimate will equal the 
actual correlation matrix. 

Recalling the earlier discussion (Sec. 8.3.2) of the minimum MSE, 
the optimum array weights are given by the optimum Wiener solution 
as 



= Rjr 

where 



(8.59) 



r — E[d* ■ x] 
R xx = E \xx H ] 



As was shown with Eq. (7.32), we can estimate the correlation matrix 
by calculating the time average such that 

1 K 

R xx = — S'' x(k)x H (k) (8.60) 

xv z J 

k=l 

where K is the observation interval. 

The correlation vector f can be estimated by 

1 K 

f=-y d*{k)x{k) (8.61) 

k=l 



Since we use a IT-length block of data, this method is called a block- 
adaptive approach. We are thus adapting the weights block-by-block. 

It is easy in MATLAB to calculate the array correlation matrix and 
the correlation vector by the following procedure. Define the matrix 
Xk (k)as the Mh block of x vectors ranging over IT-data snapshots. Thus 



X K (k) = 



xi(l + kK) 
X2(l + kK) 



xi(2 + kK) 
X2(2 + kK) 



xi (K + kK) 



(8.62) 



xm( 1 + kK) ■ ■ ■ xm( K + kK) 



where k is the block number and K is the block length. 
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Thus, the estimate of the array correlation matrix is given by 

R xx (k) = ]X K (k)X*(k) (8.63) 

In addition, the desired signal vector can be define by 

d(k) = [d(l + kK) d( 2 + kK) • • • d(K + kK)] (8.64) 

Thus the estimate of the correlation vector is given by 

f{k) = ±-d*(k)X K {k) (8.65) 

The SMI weights can then be calculated for the Mh block of length K 
as 



wsMi(k) — R xx (k)r(k) 

= [XKiWXgik)]- 1 d*{k)X K {k) (8.66) 

Example 8.7 Let us compare the SMI solution to the LMS solution of 
Example 8.5. An M = 8-element array with spacing d = .5/. has a received 
signal arriving at the angle 6q — 30°, an interferer at 0\ = —60°. Use 
MATLAB to write an SMI routine to solve for the desired weights. Assume 
that the desired received signal vector is defined by x s (k) — dos(k) where 
s(k) — cos(2*pi*t(k)/T) with T — 1 ms. Let the block length be K — 30. Time is 
defined as t = (1:K)*T/K. Assume the interfering signal vector is defined by 
Xi(k) — d\i{k ) where i(k) = randnfb.K'). Let the desired signal d(k) = s(k). 
In order to keep the correlation matrix inverse from becoming singular, add 
noise to the system with variance cr% = .01. 

Solution The MATLAB code sa ex8 7.m is used to solve the problem. We may 
calculate the array input noise by use of the MATLAB command 
n = randn(N,K)*sqrt(sig2); where sig2 = noise variance. After specifying 
the number of elements, the received signal angle, and the interfering angle, 
we calculate the received vector x(k). The correlation matrix is found using 
the simple MATLAB command Rxx = X*X’/K, the correlation vector is found 
using r = X*S’/K. The optimum Wiener weights are found and the resulting 
array pattern is seen in Fig. 8.17. The SMI pattern is similar to the LMS pat- 
tern and was generated with no iterations. The total number of snapshots K 
is less than the time to convergence for the LMS algorithm. 

The SMI algorithm, although faster than the LMS algorithm, has several 
drawbacks. The correlation matrix may be ill conditioned resulting in errors 
or singularities when inverted. In addition, for large arrays, there is the chal- 
lenge of inverting large matrices. To invert, the correlation matrix requires 
N 3 / 2 + N 2 complex multiplications [20]. The SMI update frequency will 
necessarily depend on signal frequencies and on channel fading conditions. 
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8.4.3 Recursive least squares 

As was mentioned in the previous section, the SMI technique has sev- 
eral drawbacks. Even though the SMI method is faster than the LMS 
algorithm, the computational burden and potential singularities can 
cause problems. However, we can recursively calculate the required 
correlation matrix and the required correlation vector. Recall that in 
Eqs. (8.60) and (8.61) the estimate of the correlation matrix and vector 
was taken as the sum of the terms divided by the block length K. When 
we calculate the weights in Eq. (8.66), the division by K is cancelled 
by the product R~ x (k)r(k). Thus, we can rewrite the correlation matrix 
and the correlation vector omitting K as 

k 

R xx (k) = ^x(i)x H (i) (8.67) 

1=1 
k 

f(k) = Y J d%i)x(i) (8.68) 

i = 1 

where k is the block length and last time sample k and R xx (k),r(k) is 
the correlation estimates ending at time sample k. 

Both summations (Eqs. (8.67) and (8.68)) use rectangular windows, 
thus they equally consider all previous time samples. Since the signal 
sources can change or slowly move with time, we might want to deem- 
phasize the earliest data samples and emphasize the most recent ones. 
This can be accomplished by modifying Eqs. (8.67) and (8.68) such that 
we forget the earliest time samples. This is called a weighted estimate. 
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Thus 



k 



R xx (k ) = ^2°t k l x(i)x H (i) 
1=1 


(8.69) 


k 

r{k ) = a k ~ l d*{i)x{i) 

1 = 1 


(8.70) 



where a is the forgetting factor. 

The forgetting factor is also sometimes referred to as the exponential 
weighting factor [37]. a is a positive constant such that 0 < a < 1. 
When a — 1, we restore the ordinary least squares algorithm, a — 1 
also indicates infinite memory. Let us break up the summation in Eqs. 
(8.69) and (8.70) into two terms: the summation for values up to i —k-1 
and last term for i = k. 

k-i 

R xx {k) = a a k ~ 1 ~ l x{i)x H {i) + x(k)x H (k) 

1 = 1 

= aR xx (k — 1) + x(k)x H (k) (8.71) 

k-i 

r{k) = a ^ a k ^ 1 ^ l d*(i)x(i) + d*(k)x(k) 

1 = 1 

= ar(k — 1) + d*(k)x(k) (8.72) 

Thus, future values for the array correlation estimate and the vector 
correlation estimate can be found using previous values. 

Example 8.8 For an M = 4-element array, d = /./2, one signal arrives at 
45°, and S(k) = cos(27r(£ — 1)/(K — 1)). Calculate the array correlation for a 
block of length K = 200 using the standard SMI algorithm and the recursion 
algorithm with a — 1. Plot the trace of the SMI correlation matrix for K data 
points and the trace of the recursion correlation matrix vs. block length k 
where 1 < k < K . 

Solution Using MATLAB code sa ex8 8.m we can construct the array steering 
vector for the angle of arrival of 45° . After multiplying the steering vector by 
the signal S(k) we can then find the correlation matrix to start the recursion 
relationship in Eq. (8.71). After K iterations, we can superimpose the traces 
of both correlation matrices. This is shown in Fig. 8.18. 

It can be seen that the recursion formula oscillates for different block 
lengths and that it matches the SMI solution when k — K. The recursion 
formula always gives a correlation matrix estimate for any block length k 
but only matches SMI when the forgetting factor is 1. The advantage of the 
recursion approach is that one need not calculate the correlation for an entire 
block of length K. Rather, each update only requires one a block of length 1 
and the previous correlation matrix. 
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Not only can we recursively calculate the most recent correlation 
estimates, we can also use Eq. (8.71) to derive a recursion relation- 
ship for the inverse of the correlation matrix. The next steps follow 
the derivation in [37]. We can invoke the Sherman Morrison -Woodbury 
(SMW) theorem [38] to find the inverse of Eq. (8.71). Repeating the 
SMW theorem 

IA + zz H r' = A-' - (8.73) 

1 + z H A- x z 



Applying Eq. (8.73) to Eq. (8.71), we have the following recursion 
formula: 

47»i - - 1) - ( * - ~ » (a74 , 

1 + a~ 1 x H (k)R~J-(k — l)x(k) 

We can simplify Eq. (8.74) by defining the gain vector g(k) 

a^R-Mk — l)x(k) 



g(k) = 



1 + a~ 1 x H (k)R~ 1 (k — l)x(&) 



Thus 



R-^ik) = a-'R-^ik - 1) - a~ 1 g(k)x H (k)R~J'(k - 1) 



(8.75) 



(8.76) 



Equation (8.76) is known as the Riccati equation for the recursive least 
squares (RLS) method. We can rearrange Eq. (8.75) by multiplying the 
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denominator times both sides of the equation to yield 

g(k) = [a-'R^ik - 1) - <x~ l g{k)x H {k)R^{k - \)}x{k) (8.77) 

It is clear that the term inside of the brackets of Eq. (8.77) is equal to 
Eq. (8.76). Thus 

gik) = R- x Hk)x(k) (8.78) 

Now we can derive a recursion relationship to update the weight vec- 
tors. The optimum Wiener solution is repeated in terms of the iteration 
number k and we can substitute Eq. (8.72) yielding 

w{k) — R~ x (k)r(k) 

= aR~ x \k)f(k - 1) + R- X \k)x(k)d*(k) (8.79) 

We may now substitute Eq. (8.76) into the first correlation matrix in- 
verse seen in Eq. (8.79). 

w(k ) — R xx (k — 1 )r(k — 1) — g(k)x H {k)R~ x (k — l)r(k — 1) 

+ R~J-(k)x(k)d*(k) 

— w{k — 1) — g(k)x H (k)w(k — 1) + R~ x (k)x(k)d*(k) (8.80) 

Finally we may substitute Eq. (8.78) into Eq. (8.80) to yield 
w{k) — w(k — 1) — g(k)x H {k)w(k — 1) + g(k)d*(k) 

— w(k — 1) + g(k)[d*{k) — x H {k)w{k — 1)] (8.81) 

It should be noted that Eq. (8.81) is identical in form to Eq. (8.56). 

Example 8.9 Use the RLS method to solve for the array weights and plot the 
resulting pattern. Let the array be an M = 8-element array with spacing 
d — .5 a with a received signal arriving at the angle Qq = 30°, an interferer 
at = —60°. Use MATLAB to write an RLS routine to solve for the desired 
weights. Use Eqs. (8.71), (8.78), and (8.81). Assume that the desired received 
signal vector is defined by x s (k ) = aos(k) where s(k) = cos(2*pi*t(k)/T); with 
T = 1 ms. Let there be K = 50 time samples such that t = (0 : K—1)*T/(K—1). 
Assume that the interfering signal vector is defined by Xi(k) = d\i{k) where 
i(k) = sin(pi*t(k)/T);. Let the desired signal d(k) = s(k). In order to keep the 
correlation matrix inverse from becoming singular, add noise to the system 
with variance o% = .01. Begin with the assumption that all array weights are 
zero such that u>(l) = [0 0 0 0 0 0 0 0] r . Set the forgetting factor 
a = .9. 



Solution MATLAB code sa_ex8_9.m is used to solve for the array weights and 
to plot the resulting pattern shown in Fig. 8.19. 
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The advantage of the RLS algorithm over SMI is that it is no longer nec- 
essary to invert a large correlation matrix. The recursive equations allow for 
easy updates of the inverse of the correlation matrix. The RLS algorithm also 
converges much more quickly than the LMS algorithm. 

8.4.4 Constant modulus 

Many adaptive beamforming algorithms are based on minimizing the 
error between a reference signal and the array output. The reference 
signal is typically a training sequence used to train the adaptive array 
or a desired signal based upon an a priori knowledge of nature of the 
arriving signals. In the case where a reference signal is not available 
one must resort to an assortment of optimization techniques that are 
blind to the exact content of the incoming signals. 

Many wireless communication and radar signals are frequency-or 
phase-modulated signals. Some examples of phase and frequency mod- 
ulated signals are FM, PSK, FSK, QAM, and polyphase. This being the 
case, the amplitude of the signal should ideally be a constant. Thus the 
signal is said to have a constant magnitude or modulus. However, in 
fading channels, where multipath terms exist, the received signal is 
the composite of all multipath terms. Thus, the channel introduces 
an amplitude variation on the signal magnitude. Frequency selective 
channels by definition destroy the constant modulus property of the 
signal. If we know that the arriving signals of interest should have a 
constant modulus, we can devise algorithms that restore or equalize the 
amplitude of the original signal. 
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Dominique Godard [39] was the first to capitalize on the constant 
modulus (CM) property in order to create a family of blind equaliza- 
tion algorithms to be used in two-dimensional data communication 
systems. Specifically, Godard’s algorithm applies to phase modulating 
waveforms. Godard used a cost function called a dispersion function of 
order p and, after minimization, the optimum weights are found. The 
Godard cost function is given by 

J(k) = E[(\y(k)\ p -R p ) q ] (8.82) 

where p is the positive integer and q is the positive integer = 1. 

Godard showed that the gradient of the cost function is zero when R p 
is defined by 



= E[\s{k)\ 2p \ 
p E[\s(km 

where s{k) is the zero-memory estimate of y(k). 

The resulting error signal is given by 

e(k) = y(k)\y(k)\r- 2 (R P - \y(k)\ p ) 



(8.83) 



(8.84) 



This error signal can replace the traditional error signal in the LMS 
algorithm to yield 



w{k + 1) = w{k) + pe*(k)x(k) (8.85) 



The p — 1 case reduces the cost function to the form 

J{k) = E[{\y{k)\-R l ) 2 ) (8.86) 



where 



E[\s(k)\ 2 } 

E[\s(k)\) 



(8.87) 



If we scale the output estimate s{k) to unity, we can write the error 
signal in Eq. (8.84) as 



e(k) — 




Thus the weight vector, in the p — 1 case, becomes 




( 8 . 88 ) 



w{k + 1) = w(k ) + p, l — 



y*(k)x(k ) 



(8.89) 
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The p — 2 case reduces the cost function to the form 

J(k) = E[{\y{k)\ 2 - Rz) 2 } (8.90) 

where 

E[\ s m 4 ] 



r 2 = 



E[\s(k)\ 2 ] 



(8.91) 



If we scale the output estimate s(k) to unity, we can write the error 
signal in Eq. (8.84) as 

elk) = y(k)( 1 - \y(k)\ 2 ) 



Thus the weight vector, in the p — 2 case, becomes 

w{k + 1) = w{k) + /x( 1 — \y{k) \ 2 )y*(k)x(k) 



(8.92) 

(8.93) 



The cases where p — 1 or 2 are referred to as constant modulus algo- 
rithms (CMA). The p — 1 case has been proven to converge much more 
rapidly than the p = 2 case [40]. A similar algorithm was developed by 
Treichler and Agee [41] and is identical to the Godard case for p = 2. 



Example 8.10 Allow the same constant modulus signal to arrive at the re- 
ceiver via a direct path and two additional multipaths and assume that the 
channel is frequency selective. Let the direct path arriving signal be defined 
as a 32 -chip binary sequence where the chip values are ±1 and are sampled 
four times per chip. The direct path signal arrives at 45°. The first multipath 
signal arrives at —30° but is 30 percent of the direct path in amplitude. The 
second multipath signal arrives at 0° but is 10 percent of the direct path 
in amplitude. Because of multipath, there will be slight time delays in the 
binary sequences causing dispersion. This time delay can be implemented 
by zero padding the multipath signals to effect a time delay. We will use the 
P = 1 CMA algorithm to define the optimum weights. Choose fi = .5, N = 8 
elements, and d = >72 . Define the initial weights w( 1) to be zero. Plot the 
resulting pattern. 

Solution The three received waveforms are shown in Fig. 8.20. The last block 
indicates the combined received waveforms. It can be seen that the combined 
received signal has an amplitude variation due to channel dispersion. 

The array output is defined as y(k) = w H (k)x(k) . The recursion relationship 
for the array weights is given in Eq. (8.89). MATLAB code sa_ex8_10.m is used 
to solve for the weights. The resulting pattern plot is given in Fig. 8.21. It 
should be noted that the CMA algorithm suppresses the multipath but does 
not cancel it. 



8.4.5 Least squares constant modulus 

One severe disadvantage of the Godard CMA algorithm is the slow 
convergence time. The slow convergence limits the usefulness of the 
algorithm in dynamic environments where the signal must be captured 
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Figure 8.20 (a) Direct path, (6) path 2, (c) path 3, and ( d ) combined signals. 




Figure 8.21 CMA pattern. 
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quickly. This also limits the usefulness of CMA when channel conditions 
are rapidly changing. The previous CMA method is based upon the 
method of steepest descent by taking the gradient of the cost function 
in Eq. (8.82). A faster algorithm was developed by Agee [42] using the 
method of nonlinear least-squares. The least-squares method is also 
known as the Gauss method based upon the work of Gauss in 1795 
[43]. This method is referred to as the LS-CMA algorithm [44] and is 
also known as an autoregressive estimator based on a least squares 
minimization [45]. 

The following derivation is taken directly from [42] and [44] . In the 
method of least squares, one defines a cost function which is the weighted 
sum of error squares or the total error energy. The energies are the 
energies of a finite sample set K. The cost function is defined by 

K 

C(w) = ^2 \<Pk(w)\ 2 =|| H 2 (8.94) 

k=i 

where </>&(u>) = error at Mh data sample 

4>( ic ) = [0 i («5) ■■■ c/>k(w)] t 

K — number of data samples in one block 

Equation (8.94) has a partial Taylor-series expansion with a sum-of- 
squares form given as 

C(w + A) ss ||<t>(u}) +J h (w)A \\ 2 2 (8.95) 

Where the complex Jacobian of <t>(u>) is defined as 

J{w) = [V</>i(iD) V</>2 (w) ■■■ V(/> K (w)] (8.96) 

and A is the offset that updates weights. 

We wish to find the offset A which minimizes the sum-of-squared 
errors. Taking the gradient of Eq. (8.95) and setting it equal to zero, we 
can find the optimum offset vector to be defined as 

A = - [T(w)T H (w)\ (8.97) 

The new updated weight vector is then given by 

w(n+ 1) = win) — [J(iv(n))J H (w(n))] 1 J(w(n)) < t>(w(n)) (8.98) 

The new weight vector is the previous weight vector adjusted by the 
offset A. The number 11 is the iteration number not to be confused with 
the time sample k. 




More Books Visit : www.iqbalkalmati.blogspot.com 



Smart Antennas 243 



|_l:v(*0l-ij 

We may now define the Jacobian of the error vector (p ( w) 
J(w) = [Vitpiiw) V(0 2 (d)) ••• V(cp K (w)] 



where 



y*(l) y*(2) 

= rw " (2) w2h 

= XY cm 

X= [x(l) x{2) •• 



y*(K) 

\y(K)\ 



x(K)] 



Let us now apply the least squares method to the CMA using the 1-2 
cost function [8] . 

K K 

C(w ) = ^2 \<Pk(u >)\ 2 =Y \\y(k)\ - 1| 2 (8.99) 

k=i k=i 

where y(k) = w H x(k) — array output at time k. 

We may write <pk as a vector such that 

'|y(l)| - r 
IM2)| - 1 

(pm = . (8.100) 



( 8 . 100 ) 



( 8 . 101 ) 



(8.102) 



~ y*h) q 

|yd)| u 



Yc.m = 



(8.103) 



0 0 

L u u LyOOlJ 

Multiplying the Jacobian times its Hermitian transpose, we get 
J{w)J H {w) = XY cm Yq M X h = XX H 0 



(8.104) 



The product of the Jacobian times the energy matrix is given by 





'IMDI-1' 






|y(2)| — 1 




= XY C m 




= X 




_\y(K)\ - 1 





*d) _ y*(D 

y '•D |y(l)| 

v*(2) — 

y Z |y(2)| 



V *(K) - yHK) 



= X{y — r)* 



(8.105) 
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where 



y — [y(l) y(2) ■■■ y(K)] T (8.106) 

and 



~ yd) 
JMDI 



yi 2 ) 
|y(2)| 



y(tf) 

Wl 



= 



(8.107) 



where L(y)is a hard-limiter acting on y. 

Substituting Eqs. (8.103) and (8.105) into Eq. (8.98), we get 



w(R+D=«)(n)-[lI H ] 1 X iyin) — rin))* 

= win) - [XX H ] ~ 1 XX H w(n) + {XX^Xf'in) 

= [XX H ] _1 Xr*(7z) (8.108) 



where 



r * in) 



w H in)x (1) 


w H in)xi2) 


w H in)xiK) 


w H in)xi 1) 


w H i n)x(2) 


w H in)xiK)\ 



(8.109) 



While only one block of data is used to implement the LS-CMA, the 
algorithm iterates through n values until convergence. Initial weights 
wil) are chosen, the complex-limited output data vector r*(l) is cal- 
culated, then the next weight u>i2) is calculated, and the iteration 
continues until satisfactory convergence is satisfied. This is called the 
static LS-CMA algorithm because only one static block, of length K, is 
used for the iteration process. The LS-CMA algorithm bears a striking 
resemblance to the SMI algorithm in Eq. (8.66). 



Example 8.11 Allow the same constant modulus signal to arrive at the 
receiver via a direct path and one additional multipath and assume that the 
channel is frequency selective. Let the direct path arriving signal be defined 
as a 32 -bit binary chipping sequence where the chip values are ±1 and are 
sampled four times per chip. Let the block length K = 132. The direct path 
signal arrives at 45°. A multipath signal arrives at —30° but is 30 percent 
of the direct path in amplitude. Because of multipath, there will be slight 
time delays in the binary sequences causing dispersion. Let the dispersion 
be implemented by zero padding the two signals. Zero pad the direct path 
by adding four zeros to the back of the signal. Zero pad the multipath signal 
with two zeros in front and two zeros in back. Use the LS-CMA algorithm 
to find the optimum weights. N = 8 elements and cL = X/2. Add zero mean 
Gaussian noise for each element in the array and let the noise variance be 
o% = .01. Define the initial weights wil) to be one. Iterate three times. Plot 
the resulting pattern. 
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Solution The MATLAB code sa ex8 ll.m can be utilized for this purpose. 

The CM waveforms are generated identically to Example 8.10. The resulting 

pattern is shown in Fig. 8.22. 

It should be noted that LS-CMA algorithm does a better job of nulling 
the multipath terms than does the CMA algorithm from the previous 
example. 

The chief advantage of the static LS-CMA is that it can converge up to 
100 times faster than the conventional CMA algorithm. In fact, in this 
example, the weights effectively converged after only a few iterations. 

The static LS-CMA algorithm computed the weights simply based 
upon a fixed block of sampled data. In order to maintain up-to-date 
adaptation in a dynamic signal environment, it is better to update the 
data blocks for each iteration. Thus a dynamic LS-CMA algorithm is 
more appropriate. The dynamic LS-CMA is a modification of the previ- 
ous static version. Let us define a dynamic block of data as the array 
output before applying weights. For the nth iteration, the ?ith block of 
length K is given as 

X{ri) = [x(l + nK) x(2 + nK) ••• x(K + nK)] (8.110) 

The weighted array output, for the nth iteration is now defined as 

y(n) = \y(l + nK) y(2 + nK) ■■■ y(K + nK)] T = [w H (n)X(n)] T 

( 8 . 111 ) 



More Books Visit : www.iqbalkalmati.blogspot.com 



246 Chapter Eight 



The complex limited output data vector is given as 



r(n) — 



y{ 1 + nK) 
\y( 1 + nK) | 



y(2 + nK) 
\y{2 + nK)\ 



y(K + nK) 
\y(K + nK)\ 



lT 



Replacing Eq. (8.108) with the dynamic version we have 



w(n+ 1) = [X{?i)X H (n)] 1 X{n)r*(n) 



( 8 . 112 ) 



(8.113) 



where 

. [ w H (n)x(l + tiK) w H {n)x{2 + nK) 

r (n) = 

[ | w H ( n)x( 1 + nK) | | w H ( n)x( 2 4- nK) \ 



i H 



w H (n)x(K + jiK) 

| w H {n)x(K + nK) \ J 
(8.114) 



We can further simplify Eq. (8.113) by defining the array correlation 
matrix and the correlation vector as 



and 



K xx {n) 



X(n)X H {n) 

K 



„ . . X{n)r*{n) 

Pxr(n) = — 



The dynamic LS-CMA is not defined as 



(8.115) 



(8.116) 



w(n+ 1) = R x J-(n)p xr (n) 



(8.117) 



Example 8.12 Repeat Example 8.11 using the dynamic LS-CMA. Define the 
block length K = 22 data points. Allow the block to update for every 
iteration n. N — 8-elements and d — a/2. Define the initial weights uj( 1 ) 
to be one. Solve for the weights after 6 iterations. Plot the resulting pattern. 

Solution The MATLAB code sa_ex8_12.m is used for this purpose. The CM 
waveforms are generated identically to Example 8.10 except that the block 
of fT-data points moves with the iteration number. The resulting pattern is 
shown in Fig. 8.23. 



8.4.6 Conjugate gradient method 

The problem with the steepest descent method has been the sensitiv- 
ity of the convergence rates to the eigenvalue spread of the correlation 
matrix. Greater spreads result in slower convergences. The convergence 
rate can be accelerated by use of the conjugate gradient method (CGM). 
The goal of CGM is to iteratively search for the optimum solution by 
choosing conjugate (perpendicular) paths for each new iteration. Con- 
jugacy in this context is intended to mean orthogonal. The method of 
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CGM produces orthogonal search directions resulting in the fastest con- 
vergence. Figure 8.24 depicts a top view of a two-dimensional perfor- 
mance surface where the conjugate steps show convergence toward the 
optimum solution. Note that the path taken at iteration n + 1 is per- 
pendicular to the path taken at the previous iteration n. 

The CGM method has its early roots in the desire to solve a system 
of linear equations. One of the earliest references to CGM is found in a 
1952 document by Hestenes and Stiefel [46]. Additionally, early work 




Figure 8.24 Convergence using 
conjugate directions. 



More Books Visit : www.iqbalkalmati.blogspot.com 



248 Chapter Eight 

was performed by Fletcher and Powell in 1963 [47] and Fletcher and 
Reeves in 1964 [48]. The CGM has also has been called the accelerated 
gradient approach (AG) by Monzingo and Miller [30] . The gradient is 
accelerated by virtue of choosing conjugate directions. The CGM method 
was modified for use in predicting array weights by Choi [49, 50]. A 
survey of this method has been written by Godara [51] and a concise 
summary of the method can be found in Sadiku [52]. The following 
summary is taken from [51] and [52]. 

CGM is an iterative method whose goal is to minimize the quadratic 
cost function 

Jiw) — -w H Aw — d H w (8.118) 

£ 

where 

Xl(l) X 2 (l) ■" Xm( 1) 

Xi(2) x 2 (2) ••• xm( 2) 

A = . K x M matrix of array snapshots 

xi iK) x 2 iK) ■■■ xm(K) 

K = number of snapshots 

M — number of array elements 

w = unknown weight vector 

d — [d(l) d(2) • • • d( K )] t = desired signal vector of K snapshots 

We may take the gradient of the cost function and set it to zero in 
order to find the minimum. It can be shown that 

V a J(w) = Aw — d (8.119) 

We may employ the method of steepest descent in order to iterate to 
minimize Eq. (8.119). We wish to slide to the bottom of the quadratic 
cost function choosing the least number of iterations. We may start with 
an initial guess for the weights wi 1) and find the residual r(l). The first 
residual value after at the first guess is given as 

r(l) = — = d — Aw{\) (8.120) 

We can next choose a direction vector D which gives us the new con- 
jugate direction to iterate toward the optimum weight. Thus 

D(l)=A ff r(l) (8.121) 

The general weight update equation is given by 

win + 1) = win) — p{n)D(n) (8.122) 
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Where the step size is determined by 



r H in)AA H rin) 
D H i n) A H ADi n) 



(8.123) 



We may now update the residual and the direction vector. We can pre- 
multiply Eq. (8.122) by —A and add d to derive the updates for the 
residuals. 

fin + 1) = fin) + qin)AD(n) (8.124) 

The direction vector update is given by 

Din+ 1) = A H fin+ 1) — oein)Din) (8.125) 



We can use a linear search to determine a(n) which minimizes Jiwin)). 
Thus 



airi) — 



r H in+ 1 )AA H rin+ 1) 
f H in)AA H fin) 



(8.126) 



Thus, the procedure to use CGM is to find the residual and the cor- 
responding weights and update until convergence is satisfied. It can be 
shown that the true solution can be found in no more than K iterations. 
This condition is known as quadratic convergence. 



Example 8.13 For the M = 8-element array with elements a half-wavelength 
apart, find the optimum weights under the following conditions: The arriving 
signal of interest is s =cos(7r k/K); arriving at an angle of 45° . One interfering 
signal is defined as I\ = randn(l,K); arriving at —30°. The other interfering 
signal is defined as 1 2 = randn(l,K); arriving at 0°. The noise has a variance 
of ct 2 = .001. Thus, n — cr*randn(l,K);. Use the CGM method to find the 
optimum weights when using a block size K — 20. Plot the norm of the 
residual for all iterations. Plot the resulting pattern. 

Solution The MATLAB code sa_ex8_13.m is used to solve for the optimum 
weights using CGM. The plot of the norm of the residuals is given in Fig. 8.25. 
It can be seen that the residual drops to very small levels after 14 iterations. 
The plot of the resulting pattern is shown in Fig. 8.26. It can be seen that 
two nulls are placed at the two angles of arrival of the interference. 

It should be especially noted that MATLAB provides a conjugate gra- 
dient function that can solve for the optimum weights. The function is 
the least squares implementation of CG and is given by w — lsqr(A,d); 
A is the matrix as defined earlier and d is the desired signal vector 
containing ff-data samples. Using the parameters in Example 8.12, we 
can use the MATLAB function lsqr(A,d) and produce the array factor 
plot in Fig. 8.27. The MATLAB function produces nulls in the correct 
locations but the sidelobe levels are higher than produced by the code 
sa_ex8_12.m. 
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8.4.7 Spreading Sequence Array Weights 

A radically different approach to wireless beamforming has been pro- 
posed which can augment or replace traditional approaches to DBF 
or adaptive arrays. This new approach does not perform electronic or 
digital phase shifting. It does not rely on adaptive methods; however, 




Figure 8.26 Array pattern using CGM. 
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Figure 8.27 Array pattern resulting from use of the MATLAB CGM. 



receive beams are produced in directions of interest. An array corre- 
lation matrix is not computed, array weights are not adapted, an a 
priori knowledge of the signal environment is not required, and a de- 
sired or pilot signal is not used. Rather, a clever spreading technique is 
used that uniquely defines all received directions of interest. This algo- 
rithm can create virtual contiguous beams like a Butler matrix, or it can 
selectively look in any specific directions of interest. 

This new DBF receiver is based upon a radical shift in traditional 
DBF methods. This out of the box approach blends the virtues of switched 
beam and adaptive array technologies while avoiding many of their re- 
spective weaknesses. This solution does not direct beams by any pre- 
viously known method, although this approach falls under the general 
topic of waveform diversity. This new method provides the same spa- 
tial resolution as traditional arrays while simultaneously looking in all 
desired directions of interest. 

The previous adaptive methods discussed rely on subspace methods, 
steepest descent methods, gradients, blind adaptive algorithms, signal 
coherence, constant moduli, and other known signal properties in or- 
der to provide feedback to control the array weights. There are many 
drawbacks to the previous methods including computational intensity, 
a required knowledge of signal statistics, signal independence, and slow 
convergence rates. 

This wholly new approach is based upon a recent patent by Carl 
Elam [53]. In this new method, adaptation is not necessary and 
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tracking is not performed, while excellent results are achieved. This 
approach can use any antenna configuration and, within the resolu- 
tion of the antenna geometry, can simultaneously receive numerous 
signals of unequal strength at various angles. This novel approach can 
use any arbitrary two-dimensional array to produce an instantaneous 
pin-cushion array. 

The new technique essentially works by designing the array weights 
to be time-varying random phase functions. The weights act to modulate 
each array element output. Specifically, the array outputs are weighted 
or modulated with a set of statistically independent polyphase chipping 
sequences. A different and independent modulating waveform is used 
for each antenna output. The chips are the same as chips defined in the 
traditional communications sense. The phase modulation waveforms 
purposely shred the phase of each antenna output by chipping at rates 
much higher than the baseband frequency of the message signal. This 
shredding process breaks up the phase relationship between all array 
elements and thus purposely eliminates array phase coherence. This is 
opposite of the traditional goal to achieve phase coherence for a specific 
look angle. The receiver then can see all incoming signals simultane- 
ously without the necessity of steering or adapting because the array 
elements, for the moment, become statistically independent from one 
another. The chipped incoming waveforms are processed in a quadra- 
ture receiver and are subsequently compared to similar chipped wave- 
forms stored in memory. The memory waveforms are created based upon 
expected angles of arrival. The theoretical spatial capacity is simply the 
angular space of interest divided by the antenna array beam solid angle. 



8.4.8 Description of the new SDMA receiver 

The novel DBF receiver is depicted in Fig. 8.28. The novelty of this new 
approach is vested in the nature of the signals /}„(£), the unique array 
signal memory, and correlation based detection. 

The new SDMA digital beamformer can be used with any arbitrary 
Af-element antenna array. It can be a linear array but should preferably 
be a two-dimensional or three-dimensional random array such that the 
antenna geometry and element phasing is unique for each incoming 
angle-of-arrival. Ideally the receive array should have an equal angular 
resolution over all receive angles of interest. For purposes of illustra- 
tion, the array used in this discussion will be an Ai-element linear array. 

The incoming signals arrive at angles Of where l = 1, 2 ... L. Each 
different angle of arrival produces a unique array element output 
with a unique phase relationship between each element. These phase 
relationships will be used in conjunction with the modulations fi n (t) to 
produce a unique summed signal y r (t). 
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0i 





0L 




Figure 8.28 SDMA quadrature receiver. 



Corresponding to the actual JV-element antenna array is a second 
virtual array modeled in memory. The virtual array is modeled after the 
actual physical array used. The memory array has K possible virtual 
outputs for each expected direction Ok(k — 1,2 , ,K). The total number 
of expected directions K should be less than or equal to the number of 
antenna elements N. These memory signals are generated based upon 
a knowledge of the antenna array geometry and the calculated phase 
delays for each specific direction of interest. The expected directions are 
generally chosen by the user to be no closer than the angular resolution 
allowed by the array. Consideration can be given to the local topography 
and potential multipath directions in eliminating possible unwanted 
expected directions. All potential incoming directions may not be used 
if it is desired to block interfering signals from known directions. 

The A-element array outputs and the A-antenna array memory out- 
puts are both phase modulated (PM) by the same set of A pseudo-noise 
( pn) phase modulating sequences. The nth phase modulating sequence 
will be designated as p n (t). p n (t) is composed of M polyphase chips. Each 
chip is of length r c and the entire sequence is of length T — Mr c . The 
chip rate is chosen to be much greater than the Nyquist rate of the 
incoming baseband signal modulation. The purpose for this over sam- 
pling is so that the phase modulation of the incoming signal is nearly 
constant over the entire group of M chips. In general, the goal should 
be such that T < 1/ (4 B m ) where B m is the message signal bandwidth. 

Each phase modulating waveform f3 n {t) is used to modulate or tag 
each array output with a unique marking or identifying waveform. p n (t) 
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deliberately shreds or scrambles the phase of the signal at array ele- 
ment n. This shredding process, temporarily scrambles the phase rela- 
tionship between all other array elements. The desired element phasing 
is restored in the correlator if the incoming signal correlates with one 
of the memory signals. 

Examples of some usable pn modulating sequences are Gold, Kasami, 
Welti, Golay, or any other sequences with similar statistical and or- 
thogonal properties [54], [56]. These properties will aid in the iden- 
tification of the exact directions of arrival of the L incoming signals. 
Figure 8.29 shows a typical example of the first two of the set of N 
bi-phase sequences applied to an IV-element array. The total code length 
TB m = .25. 

Since the phase out of each antenna element is intentionally scram- 
bled by the modulating sequences, the array pattern is randomized 
during each chip in the sequence. The momentarily scrambled pattern 
changes every r e seconds. As an example, the array patterns are plotted 
for the first four chips of an N = 10-element array as shown in Fig. 8.30. 
Since the phase relationship between all elements is scrambled, the 
array patterns for each new set of chips is random. In the limit, as the 
number of chips increases, the average array pattern, over all chips, 
becomes uniform. 

Each baseband output of the receive array will have a complex volt- 
age waveform whose phase will consist of each emitter’s message signal 
mdt) and the unique receive antenna element phase contributions. 





Figure 8.29 Two sample pn bi-phase sequences. 
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Figure 8.30 Scrambled patterns for the first four chips. 



Ignoring the space loss and polarization mismatches, the received base- 
band array output is given in vector form as 



1 • • • 1 




r ejmMl 


gjkd sin(0i) # # # gjkdsin(OL) 












gj(n—l)kdsin(Oi) # # # ^j(n—l)kdsm(6L) 




C- 



= A r -s r (t ) (8.127) 



where m e (t) 
d 
k 



A r 

s r 



fth emitter’s phase modulation 
array element spacing 
wavenumber 

angle of arrival of the fth incoming signal 

e jkdsin(e t ) e j(n-l)kdsin(0i)]T 

steering vector for direction Of 

matrix of steering vectors for all angles of arrival Of 

vector of arriving signal baseband phasors 



The received signals, for each array output, are phase modulated with 
the chipping sequences as described earlier. The chipping waveforms 
can be viewed as phase-only array weights. These array weights can be 
depicted as the vector The total weighted or chipped array output 
is called the received signal vector and is given by 

y r (t) — fKt) T ■ X r (t ) 



(8.128) 
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In a similar way, the array signal memory steering vectors are created 
based upon M expected angles-of-arrival 9 m . 



A e = 



1 

gjkd sin(#i) 
r,j (n—l)kd sin(#i ) 



1 

e jkd sin(%) 
0 j(n—l)kd sin(fe) 



L®i 



a K\ 



(8.129) 



where a| is the steering vector for expected direction 9k and A e is the 
matrix of steering vectors for expected direction 9k. 

The memory has K outputs, one for each expected direction 9k. Each 
memory output, for the expected angle Ok is given by 



y\{t) = p(t) T ■ a| 



(8.130) 



The signal correlator is designed to correlate the actual received sig- 
nal with the conjugate of the various expected direction memory signals. 
This is similar to matched filter detection. The best correlations occur 
when the actual AOA matches the expected AOA. The correlation can 
be used as a discriminant for detection. Since the arriving signals have 
a random arrival phase delay, a quadrature correlation receiver should 
be employed such that the random carrier phase does not affect the 
detection (Haykin [54]). The general complex correlation output, for 
the Mh expected direction, is given as 

rt+T 

Rk = J y r (t) ■ f k *it)dt = \R k \e m (8.131) 

where Rk is the correlation magnitude at expected angle 9k and <j>k is 
the correlation phase at expected angle Ok. 

The new SDMA receiver does not process the incoming signals with 
phase shifters or beam steering. We do not look for the emitter direction 
by steering but actually find the direction by correlation. The correla- 
tion magnitude \Rk\ is used as the discriminant to determine if a signal 
is present at the expected angle Ok. If the discriminant exceeds a prede- 
termined threshold, a signal is deemed present and the phase is calcu- 
lated. Since it is assumed that the emitter PM is nearly constant over 
the code length Mr c , the correlator output phase angle is approximately 
the average of the emitter PM. Thus 



<p k = arg (R k ) ss fh k 



(8.132) 



where 

i r t+T 

mk ~Tj t 



mk(t)dt = average of emitter’s modulation at angle Ok 
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The average phase ihk retrieved for each quarter cycle of the mit) can 
be used to reconstruct the user’s phase modulation using an FIR filter. 

Example using bi-phase chipping. We can let all chipping waveforms, 
P„(t), be defined as bi-phase pn sequences. It is instructive to show the 
appearance of the signals at various stages in the receiver. 
Figure 8.31 shows the ztth element received baseband modulation, the 
nth phase modulating waveform ( J > n (t), the nth phase modulated out- 
put y^it), and finally the N combined phase modulated outputs y'(t). 




i i i i 

0 0.05 0.1 0.15 0.2 0.25 





Figure 8.31 Display of waveforms throughout the receiver. 
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The fourth waveform is obviously unique owing to the arrival phase 
angle of the incoming signal from direction 9 e and the chipping by the 
N independent phase modulating waveforms. This specific chipped and 
summed waveform is unique to the direction of arrival and can be cor- 
related with pre-calculated waveforms stored in memory. 

Figure 8.32 compares the correlation magnitude in Eq. (8.131) for 
an incoming signal ( 9 — 0°) superimposed with the array factor for a 
typical IV-element linear array. The following receiver values are used: 
N = 21, d — a/2, M = 128 chips. Figure 8.32 demonstrates that the new 
SDMA receiver achieves the same angular resolution as the conven- 
tional linear array provided that the number of chips M is adequately 
large. Shorter sequences can be used when the sequences are not only 
independent but also orthogonal. 

When multiple signals arrive at the receive antenna array, they can 
be detected through the same process of correlating all incoming signals 
with the expected signals stored in memory. Figure 8.33 demonstrates 
the correlation for five arriving signals at equally spaced angles of —60°, 
—30°, 0°, 30°, 60°. The chipping sequences are binary Welti codes of 
length 32. 

After a signal is deemed present near the expected angle Ok, we 
can now find the average phase modulation (f>k defined in Eq. (8.133). 
Figure 8.34 shows an example where sequences of M chips are used four 
times in succession to reconstruct a piecewise estimate of the message 



N = 21 elements, d - A/2, Nchips = 128 




Figure 8.32 Comparing the new SDMA receiver to a traditional linear 
array. 
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Figure 8.33 Correlation for five equally spaced arrival angles. 



signal mk(t). Figure 8.34 shows the original PM n%(t) and the estimate 
calculated for each quarter cycle. 

It can be seen that the predicted average phase for each quarter 
cycle can be fed into an FIR filter to reconstruct the original emitter 
modulation. 




Figure 8.34 Emitter modulation with receiver estimate. 
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In summary, the new SDMA receiver uses spreading sequences, as 
array weights, to shred the phase out of each array element. This 
approach provides a radical and novel alternative to traditional fixed 
beam and adaptive array approaches. The spreading sequences are 
composed of polyphase codes that have chip rates much greater than 
the anticipated emitter baseband modulation. The shredding momen- 
tarily randomizes the phase relationship between each element out- 
put. This in turn causes the pattern to be randomized during each chip 
of the spreading sequence. Over the M chip duration of the spreading 
sequence, the average pattern for the array approaches that of an omni- 
directional array. The shredded signal is reconstructed with a quadra- 
ture correlation receiver by correlating with memory signals. The mem- 
ory signals are created by a similar shredding process but for expected 
angles of arrival. Correlator outputs are generated for each expected 
AO A. The correlation magnitude is used as a discriminant to decide if a 
signal is present at the expected angle 0*. The correlation phase angle 
is a piecewise estimate of the original emitter’s modulation. 

This new approach has numerous advantages over traditional SDMA 
array antennas. The receiver is superior to a switched beam array be- 
cause contiguous beams can be formed in any regions of interest without 
the need for hardware phase shifters. The beams are created by correla- 
tion. The receiver memory can simply be altered to redirect the beams 
to new regions of interest if so desired. The elimination of phase shifters 
can result in a significant cost savings. 

The new receiver also has advantages over adaptive arrays because 
it does not require adaptation, it can process multiple angles of arrival 
simultaneously, and it is not limited by acquisition or tracking speeds. 
Interfering signals are minimized because the shredded waveforms at 
interferer angles do not correlate well with expected direction wave- 
forms stored in memory. 

Any arbitrary and/or random antenna array geometry can be incor- 
porated into this new approach as long as the expected memory signals 
are based upon the array geometry. The array resolution achieved is 
consistent with the limitations of the array geometry. 
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Problems 

8.1 For an M = 5-element array with d = X/ 2, the desired signal arrives at 
9 — 20°, one interferer arrives at —20°, while the other interferer arrives at 
+45°. The noise variance is cr 2 = .001. Use the Godara method outlined in 
Eq. (8.8) 

(a) What are the array weights? 

(b) Plot the magnitude of the weighted array pattern for —90° < 9 < 90°. 

8.2 Maximum SIR method: Given a M = 5-element array with spacing d. — .5/. 
and noise variance cr 2 — .001, a desired received signal arriving at 9q — 20°, and 
two inteferers arrive at angles 9\ = —30° and 62 = —45°, assume that the signal 

_ n • • • 0 

and interferer amplitudes are constant and that R ss = Ru = • ' ■ . ■ 

Lo • • • i 

(a) Use MATLAB to calculate SIR max 

(b) What are the normalized weights? 

(c) Plot the resulting pattern. 

8.3 Minimum Mean Square Error (MSE) method: For an M = 2-element array 
with elements a half-wavelength apart, find the optimum Wiener weights under 
the following conditions: The arriving signal of interest is s(f) = e J<ot arriving 
at the arbitrary angle of 9 S . The noise is zero-mean Gaussian with arbitrary 
variance a 2 . Allow the desired signal d(t) = sit). 

(a) What is the signal correlation matrix, R ss , in symbolic form? 

(b ) What is the array correlation matrix, R xx , in symbolic form? 

(c) What is the correlation vector r ? 

id) What is the correlation matrix inverse, R xx , in symbolic form? 

(e) Symbolically derive the equation for the weights. 

if) What are the exact weights for 9 S = 30° and for rx 2 — .1? 

8.4 Minimum Mean Square Error (MMSE) method: Let the array be a two 

element array where the arriving signal is coming in at 8 = 0° . Let the desired 
signal equal the arriving signal with s = 1. Thus, x = . Let d = X/2. 

(a) Derive the performance surface using Eq. (8.23) and plot for —4 < 
w 1, iv 2 < 4. 

(b) Derive the solutions for wl and w2 by using and 7 ^. 

(c) How is the derived solution consistent with the plot? (Recall that the 
V„,£[|£ 2 |] = 0 gives the bottom of the curve.) 

(d) Derive the performance surface using Eq. (8.23) for the arrival angle 
of 9 = 30° 

(e) Derive the solutions for w\ and W 21 by using and for the angle 
given in part d). 

8.5 MMSE method: Given an M = 5-element array with spacing d = .5/., 
a received signal energy S = 1 arriving at 9q — 30°, and two interferers 
arriving at angles 9\ — —20° and 92 = 40°, with noise variance cr 2 = .001. 
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Assume that the signal and interferer amplitudes are constant and that 

Rss = Rii = [5 5 ] • 

(a) Use MATLAB to calculate the optimum weights. 

(b) Plot the resulting pattern. 

8.6 Maximum likelihood (ML) method: Given an M = 5-element array with 
spacing d = .5/ which has a received signal arriving at the angle 9q — 45°, with 
noise variance cr 2 = .01. 

(a) Use MATLAB to calculate the optimum weights 

(b) Plot the magnitude of the weighted array pattern for —90° < 9 < 90°. 

8.7 Minimum variance (MV) method: Given an M = 5-element array with 
spacing d = . 5 A which has a received signal arriving at the angle 9q = 40°, one 
interferer arriving at —20°, and noise with a variance cr 2 — 001. 

(a) Use MATLAB to calculate the optimum weights 

(b) Plot the magnitude of the weighted array pattern for —90° < 9 < 90°. 

8.8 Least mean squares (LMS) method: For an M = 2-element array with 
elements a half-wavelength apart, find the LMS weights using Eqs. (8.48) and 
(8.56) under the following conditions: // = .5, <r 2 = 0, the arriving signal is 
s(t) = 1, the angle of arrival is 9 S = 45°, the desired signal d(t) = s(f). Set the 
initial array weights such that wi 1) = [ ] . There are no other arriving signals. 

(a) By hand calculate the array weights for the next three iterations (i.e., 
w( 2), w(3), iD(4)) 

(b) What is the error |fi(&)| for k = 2, 3, 4? 

(c) Use MATLAB to calculate the weights and error for 20 iterations. Plot 
the absolute value of each weight vs. iteration k on one plot and the 
absolute value of the error vs. iteration k on another plot. 

8.9 LMS method: Given an M = 8-element array with spacing d = .5/. which 

has a received signal arriving at the angle 9q — 40°, an interferer at 9\ = 
—20°. Assume that the desired received signal vector is defined by x s (k) = 
aos(k) where s(k) = sin(pi*t(k)/T); with T = 1 ms and t = (1 : lOOlT/lOO;. 
Assume the interfering signal vector is defined by xiik) = where i(k) — 

randn(l, 100);. Both signals are nearly orthogonal over the time interval T. Let 
the desired signal d{k) = s(k). Assume that the initial array weights are all 
zero. Allow for 100 iterations. Let step size n = .02. 

(a) Calculate the eight array weights for 100 iterations. 

(b) Plot the magnitude of the each weight vs. iteration number on the 
same plot. 

(c) Plot the mean square error |e| 2 vs. iteration number 

(d) What are the array weights at the 100th iteration? 

(e) Plot the magnitude of the weighted array pattern for —90° < 9 < 90°. 

8.10 Sample matrix inversion (SMI) method: Use the SMI method to find the 
weights for an M = 8-element array with spacing d — .5/.. Let the received 
signal arrive at the angle 9q — 45° . One interferer arrives at 9i = —45° . Assume 
that the desired received signal vector is defined by x s (k) = d-os(k) where s(k) = 




More Books Visit : www.iqbalkalmati.blogspot.com 



Smart Antennas 265 



sin (2 * pi* t(k)/T) with T = 2 ms. Let the block length be K = 50. Time is 
defined as t = (1 : K)*T/K. Assume the interfering signal vector is defined by 
Xi(k) = a\i{k) where i(k) = randnd, K) + randnll, K)j. Let the desired signal 
d(k) — s(k). In order to keep the correlation matrix inverse from becoming 
singular, add noise to the system with variance cr% = .01. Define the noise in 
MATLAB asn= randn( A T , K) * sqrt(sig2);. 

(a) Find the weights. 

(b) Plot the magnitude of the weighted array pattern for— 90° <9 < 90°. 

8.11 Repeat Prob. 8 for M = 5, d = .51, 9$ = 30°, 9\ = —20°, and allow the 
received signal to be a phase modulation such that s(k) = exp( ly'*.5*pi* sin 

(pi *t(k)/T)y„ 

8.12 Recursive least squares (RLS) method: Let the array be an M = 7- 
element array with spacing d = . 5 A. with a received signal arriving at the angle 
9 0 — 30°, an interferer at 9\ = —20°, and the other interferer at 02 = —40°. 
Use MATLAB to write an RLS routine to solve for the desired weights. As- 
sume that the desired received signal vector is defined by x s (k) = dos(k) where 
s(k) = expdj * sin(pi*t(k)/T); with T = 1 ms. Let there be K = 50 time sam- 
ples such that t = (0 : K— 1)*T/(K — 1). Assume that the interfering sig- 
nals il(k) — exp(lj*pi*randn(l, K)); i2(k) = exp(lj*pi*randn(l, K));. (MATLAB 
changes the random numbers each time so that d and i2 are different. Let the 
desired signal d(k) = s(k). In order to keep the correlation matrix inverse from 
becoming singular, add noise to the system with variance cr,j = .01. Initialize 
all array weights as zero. Set the forgetting factor a — .995. 

(a) Plot the magnitude of the first weight for all iterations. 

(b) Plot the magnitude of the weighted array pattern for —90° <9 < 90°. 

8.13 Constant modulus algorithm (CMA) method: Allow the same constant 
modulus signal to arrive at the receiver via a direct path and two additional 
multipaths and assume that the channel is frequency selective. Let the direct 
path arriving signal be defined as a 32-chip binary sequence where the chip 
values are ±1 and are sampled four times per chip (see Example 8.10). The 
direct path signal arrives at 30°. The first multipath signal arrives at 0° but is 
30 percent of the direct path in amplitude. The second multipath signal arrives 
at 20° but is 10 percent of the direct path in amplitude. Because of multipath, 
there will be slight time delays in the binary sequences causing dispersion. 
Let the dispersion be implemented by zero padding the signals. Zero pad the 
direct path by adding eight zeros to the back of the signal. Zero pad the first 
multipath signal with two zeros in front and two zeros in back. Zero pad the 
second multipath signal with four zeros up front. We will use the P = 1 CMA 
algorithm to define the optimum weights. Choose /x = .6, N = 6 elements, and 
d = X/2. Define the initial weights, w( 1), to be zero. Plot the resulting pattern 
using MATLAB. 

8.14 Least squares constant modulus algorithm (LS-CMA) method: Use the 
static version of the LS-CMA. Allow the same constant modulus signal to arrive 
at the receiver via a direct path and one additional multipath and assume that 
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the channel is frequency selective. Let the direct path arriving signal be defined 
as a 32-chip binary sequence where the chip values are ±1 and are sampled 
four times per chip. Let the block length K = 132. The direct path signal arrives 
at 30°. A multipath signal arrives at —30° but is 50 percent of the direct path 
in amplitude. Because of multipath, there will be slight time delays in the 
binary sequences causing dispersion. Let the dispersion be implemented by 
zero padding the signals. Zero pad the direct path by adding four zeros to the 
back of the signal. Zero pad the multipath signal with two zeros in front and two 
zeros in back. Use the LS-CMA algorithm to find the optimum weights. N = 9 
elements and d = X/2. Add noise for each element in the array and let the noise 
variance be o% = .01. Define the initial weights, w( 1), to be one. Iterate three 
times. Plot the resulting pattern using MATLAB. 

8.15 Repeat Prob. 14 but use the dynamic LS-CMA algorithm. Let the block 
length be if = 22. Allow the block to update for every iteration n. Stop af- 
ter 5 iterations using 6 (6 • K = 132) blocks. Plot the resulting pattern using 
MATLAB. 

8.16 Conjugate gradient method (CGM): For the M — 9-element array with 
elements a half-wavelength apart, find the optimum weights under the follow- 
ing conditions: The arriving signal of interest is s = sin(7rk/K); arriving at an 
angle of 30°. One interfering signal is defined as 7j = randnd, K); arriving 
at —20°. The other interfering signal is defined as 1-2 = randnfl, K); arriving at 
—45°. The noise has a variance of o% = .001. Thus, n = cr*randn( 1, K);. Use the 
CGM to find the optimum weights when using a block size K = 20. 

(а) Plot the norm of the residual for all iterations. 

(б) Plot the resulting pattern using MATLAB. 
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